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‘We do not want to calculate, 
we want to reveal structures.’ 


- David Hilbert, 1930 - 


This book is dedicated to my teacher, Rolf Reiss. 


... you were the one who had made it so clear... 


George Harrison, All Those Years Ago 


Preface 


Multivariate extreme value theory (MEVT) is the appropriate toolbox for 
analyzing several extremal events simultaneously. However, MEVT is by no 
means easy to access; its key results are formulated in a measure-theoretic 
setup in which a common thread is not visible. 

Writing the ‘angular measure’ in MEVT in terms of a random vector, how- 
ever, provides the missing common thread: every result in MEVT, every rel- 
evant probability distribution, be it a max-stable one or a generalized Pareto 
distribution, every relevant copula, every tail dependence coefficient, etc., can 
be formulated using a particular kind of norm on the multivariate Euclidean 
space, called a D-norm. Deep results, such as Takahashi’s characterizations 
of multivariate max-stable distributions with independent or completely de- 
pendent margins, turn out to be elementary and easily seen properties of 
D-norms. 

Norms are introduced during each basic course on mathematics as soon as 
the multivariate Euclidean space is introduced. The definition of an arbitrary 
D-norm requires only the additional knowledge of random variables and their 
expectations. But D-norms not only constitute the common thread through 
MEVT, they are also of mathematical interest on their own. 

D-norms were first mentioned in Falk et al. (2004, equation (4.25)) and 
elaborated on in Falk et al. (2011, Section 4.4). However, it was recognized 
only later that D-norms are actually the skeleton of MEVT and that they 
simultaneously provide a mathematical topic that can be studied indepen- 
dently. This book fills that gap by compiling contemporary knowledge about 
D-norms and, simultaneously, offers a relaxed tour through the essentials of 
MEVT due to the D-norm approach. 

In Chapter 1, we introduce the theory of D-norms in detail and com- 
pile contemporary knowledge. Chapter 2 presents the first links with MEVT: 
multivariate generalized Pareto distributions and multivariate max-stable dis- 
tributions are introduced via D-norms. The particular role that copulas play 
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in MEVT is investigated in detail in Chapter 3. D-norms can also be defined 
on functional spaces, as in Section 1.10. This enables a smooth approach to 
functional extreme value theory in Chapter 4, in particular to generalized 
Pareto processes and max-stable processes. 

Further applications of D-norms, such as the max-characteristic function, 
multivariate order statistics or multivariate records are given in Chapter 5. 

Parts of the present text were presented during a Winter Course, organized 
by the CRoNoS COST Action, and at a tutorial preceding the 8th Interna- 
tional Conference of the ERCIM WG on Computational and Methodological 
Statistics (CMStatistics 2015), December 2015, Senate House, University of 
London, UK. The tutorial was mainly directed at PhD students and Early- 
Stage Career Investigators. 

This text was also used as a basis for several courses taught at the Uni- 
versity of Würzburg. The material in Chapter 1 can be used to give an in- 
dependent semester-long course on D-norms. The mathematical prerequisites 
are modest; a basic knowledge of analysis and probability theory, including 
Fubini’s theorem, should be sufficient in most cases. Possible applications of 
the theory of D-norms can be taken from Chapter 5. 

A one-semester course on MEVT can be based on a combination of auxil- 
iary results for D-norms, in particular from the Sections 1.1, 1.2, 1.3, 1.6, 1.7, 
and 1.10, with the introduction to univariate extreme value theory in Sec- 
tion 2.1 and the material on MEVT as developed in Sections 2.2, 2.3, and 3.1. 
An introduction to functional extreme value theory as provided in Sections 4.1 
and 4.2 can be added. Possible applications, such as generalized max-linear 
models or multivariate records and champions, can be taken from Sections 4.3 
and 5.3. 

Views on D-norms from a functional analysis and from a stochastic geom- 
etry perspective are provided in Section 1.11 and Section 1.12 respectively. 
These mathematically more advanced sections underline in particular the aim 
of this book, which is to reveal mathematical structures. It is not a book on 
statistics. 

The author greatly appreciates the constructive feedback given by his stu- 
dents; in particular, Emily Geske and Simon Kolb deserve to be mentioned 
here for their extraordinarily careful reading of the manuscript and their 
many helpful suggestions. The author is grateful to his PhD students Ste- 
fan Aulbach, Timo Fuller, Daniel Hofmann, Martin Hofmann, René Michel, 
Florian Wisheckel, and Maximilian Zott for their cooperation and numerous 
scientific contributions. The collaboration with Gilles Stupfler on the max- 
characteristic function is greatly acknowledged. 

Last, but not least, the author is in particular indebted to Sidney Resnick 
for pushing him very gently but persistently to write this book. 


Würzburg, Germany Michael Falk 
September 10, 2018 
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1 
D-Norms 


This chapter is devoted to the theory of D-norms, a topic that is of unique 
mathematical interest. It is aimed at compiling contemporary knowledge on 
D-norms. For a survey of the various aspects that are dealt with in Chapter 1 
we simply refer the reader to the table of contents of this book. 


1.1 Norms and D-Norms 


We start with a general definition of a norm on the d-dimensional Euclidean 
space R? := {æ = (x1,..., £4): ti ER, 1<i< d}. 


Definition 1.1.1 A function f : R > [0,œ0) is a norm if, for all 
x,y € RI, \ € R, it satisfies 


Condition (1.2) is called homogeneity and condition (1.3) is called triangle 
inequality or A-inequality, for short. 
A norm f : R? —> [0, 00) is typically denoted by 


d 
æl = f(z) weR’. 
Each norm on Rê defines a distance, or metric on R? via 
d 
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2 1 D-Norms 


Well-known examples of norms are the sup-norm 


lal). = max, 


and the L1-norm or Manhattan-norm 
d 
lial, = >> eal, z = (z1,..., z4) ERY, 
i=1 


THE LOGISTIC NORM 


Less obvious is that 


d 1/p 
|x|, := (>: er) , 1I1<p<œ, 


i=1 


actually defines a norm for each p > 1. This is the family of logistic norms. 
The particular case p = 2 is commonly called the Euclidean norm. 

Although condition (1.1) and homogeneity (1.2) are obvious, the proof of 
the corresponding A-inequality is a little challenging. The inequality 


d 1/p d 1/p å 1/p 
(>: [zi tu’) < (> er) + > wr) 
i=1 i=l i=l 


is known as the Minkowski inequality. We do not establish this inequality 
here, as it follows from the fact that each logistic norm ||-||,, is a D-norm; see 
Proposition 1.2.1. 

The next result shows that the sequence of p-norms decreases pointwise as 
p tends to infinity, with its limit being the sup-norm ||-||,,. As a consequence, 
lll can be added to the family of logistic norms, now satisfying pointwise 


lll < lll < Mla 
We see in (1.4) that these inequalities are maintained by the set of D-norms. 
Lemma 1.1.2 We have, for 1< p< q <œ and x € R$, 


© læl, 2 ælg 


(ii) lim æl, = lell: 


1/p 
Proof. (i) This inequality is obvious for q = oo: |la||,, < BA nil?) 


Consider now 1 < p < q < oo and choose x # 0 € R°. Put S := ||ællp 
Then, we have 
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z£ 
=I =7 
Slmt 
and we have to establish m 
zls! 
Slla 
From | 
Ti 
€ [0,1 
= e [0,1] 
and, thus, 
|x| \? |zs|\? 
< 1<i<d 
(£ ~y] e 
we obtain 


Isl. (>: e)" < (>. O z Ti aei 


which is (i). 
(ii) We have, moreover, for x 4 0 € R? and p € [1, 00) 


d i p 1/p 
lele < lel = (X (JELY) lele <2 Han -o lel: 


į=1 


which implies (ii). 


NORMS BY QUADRATIC FORMS 


Let A = (ai;)1<i,j<a be a positive definite d x d-matrix, i.e., the matrix A is 
symmetric, A = AT = (a;i)1<i,j<a, and satisfies 


az’ Ax = 5 Zilijz; > 0, rE R$, xr#0E€ RI. 
1<i,j<d 
Then, 
1 
æla := (aT Ax)? , x € RI, 


defines a norm on R?. With 
10 
a= (94). 


we obtain, for example, ||a||_, = (£? + 23)!/? = ||ælla. 


Conditions (1.1) and (1.2) are obviously satisfied. Write A = At? A12, 
where A'/? is the symmetric root of A. The ordinary Cauchy-Schwarz in- 
equality (aTy)? < (aTx)(yTy), x,y € Rt, implies 
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T 2 1/2,,)T (41/2 2 T T 
(æT Ay)? = (47x) (A"/?y)) < (wT Az) (y Ay). 
The -inequality is a consequence: 


læ +yli = (æ +y)" A(z +y) 
= £t Ax + y Ax + x Ay +y Ay 
< aT Ag + 2(xT Ax)? (yT Ay)? +y'Ay 


= ((æ7Ax)? + (y"y)?) 


DEFINITION OF D-NORMS 


The following result introduces D-norms. 


Lemma 1.1.3 (D-Norms) Let Z = (Zı,..., Za) be a random vector 
(rv), whose components satisfy 


Z >20, AZ ad, 1<i<d. 


Then, 


a r . d 
ell = B(max(inlZ)), eR, 


defines a norm, called a D-norm, and Z is called a generator of this 
D-norm ||:|| p- 


Proof. The homogeneity condition (1.2) is obviously satisfied. Further, we 
have the bounds 


læls = max |z:| = max E (læ:| 24) 


d 
< MEA < | Z. 
< E ( max (inl Z)) < E bs a z) 
d 
=X || E(Z:) =|la|,, 2 eR, 
j=l 


i.e., 
d 
læl < lællp Sllel,, æ € R°. (1.4) 


This implies condition (1.1). The A—inequality is easily seen: 
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=E (sax (zl z:)) +E (nas (i 2:)) 


1<i<d 


= llzllp + Ilyllp- 


BASIC PROPERTIES OF D-NORMS 


Denote by e; = (0,...,0,1,0,...,0) € R? the j-th unit vector in R4,1< j < 
d. Each D-norm satisfies 


lejl = E ( max 642) = PZ) =1; 


where 6;; = 1 if i = j and zero elsewhere, i.e., each D-norm is standardized. 
Each D-norm is monotone as well, i.e., for O < x < y, where this inequality 
is taken componentwise, we have 


[elo = E ( max (2:2)) < £ (ax (u2) = lilin. 


1<i<d 1<i<d 


Note that there are norms that are not monotone: choose, for example, 


=) 


with ô € (—1,0). The matrix A is positive definite, but the norm |||, = 
(at Ax)!/? = (a? + 26a 22 + «2)!/? is not monotone; put, for example, 6 = 
—1/2 and set xı = 1, z2 = 0, yı = 1, and y2 = 1/2. Then, æ < y, but 


ella =1> lyla = V3/2. 


Each D-norm is obviously radially symmetric, i.e., changing the sign of 
arbitrary components of x € R? does not alter the value of ||æ||p. This means 
that the values of a D-norm are completely determined by its values on the 
subset {a € R¢: £ > 0}. The above norm ||-|| ,, for example, is in general not 
radially symmetric. 
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1.2 Examples of D-Norms 


Choose the constant generator Z := (1,1,...,1). Then, 


ley = E ( max (lel 2) ) = E ( max (eD) = lelo 


i.e., the sup-norm is a D-norm. 
Let X > 0 be an rv with E(X) = 1 and put Z := (X,X,...,X). Then, Z 
is a generator of the D-norm 


ell p = B( max (le: Z) = E(amax (le: X) 


max (DEX) = [lalla EX) = [allo (1.5) 


This example shows that the generator of a D-norm is in general not 
uniquely determined; even its distribution is not. 

Now let Z be a random permutation of (d,0,...,0) € R? with equal 
probability 1/d, i.e., 


Zz, = d, wih probability 1/d, eisa 
0, with probability 1 — 1/d, 


and Zı + ---+ Za = d. In what follows, we use 1(Z; = d) to denote the 
indicator function of the event Zj = d, i.e., 1(Z; = d) = 1 if Zj = d and 
1(Z; = d) = 0 elsewhere. The rv Z is the generator of a D-norm: 


d 
=E | max (lei Z) X 1(Z;=4) | =E | $- max (lz: Zi) (Z; =4) 
< m 


d d 
=E 2 lz; d1(Z; =d) | = 2 lz;l dE (1(Z; = d)) 


d d 
= Sole) dP(Z; = d) = $ |z;| = leh 
j=l j= 
i.e., ||-||, is a D-norm as well. 


Inequality (1.4) shows that the sup-norm ||-||,, is the smallest D-norm and 
that ||-||, is the largest D-norm. 
EACH LOGISTIC NORM Is A D-NORM 


Using I'(s) = =f, t lexp(—t) dt, s > 0, we denote the usual Gamma 
function. 
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Proposition 1.2.1 Each logistic norm ||æ||, = (On jai? )*/?, 1< 
p< œ, is a D-norm. For 1 < p < œ a generator is given by 


1/p 1/p 
20) = (2,...,2) = a oe | 


T-pyT0-) 


where X1,...,Xq are independent and identically (iid) standard 
Fréchet-distributed rv, i.e., 


P(X; < x£) =exp(-27"), Co) at el en, 


with E (ee) =P(1-p"}), 1l<i<d. 


From Lemma 1.1.2, we know that ||-||,, po ||-|,, pointwise. We have, 
moreover, I" (1 — p+) >p> P'(1) = 1 and, consequently, we also have point- 
wise convergence almost surely (a.s.): 


ZP) = (aaa) 


x1? X4 
“ATa-p T  mM-r 


Seco (Basag) a.s., 


where the constant Z = (1,...,1) € R? is a generator of the sup-norm ||la- 
Note that, on the other hand, I (1 —p') pi co and, thus, 


ze) — Cage spi O€ R, 


which is not a generator of a D-norm. This is an example where pointwise con- 
vergence of D-norms does not entail convergence of their generators. However, 
the reverse implication is correct, see Corollary 5.1.13. 

Before we prove Proposition 1.2.1, we state a tool that is often used in this 
book. 


Lemma 1.2.2 Let X be an rv with X > 0 a.s. Then, its (possibly 
infinite) expectation can be written as 


[rapa f 1- Fede 


where F is the distribution function (df) that corresponds to X. 
If X <0 a.s., then 


ao ae i ” Rat 


8 1 D-Norms 


Proof. Suppose X > 0 a.s. Using 14(t), we denote the indicator function of a 
set A, i.e., L4(t) = 1 if t € A and zero elsewhere. Fubini’s theorem implies 


E(X) = f x F(dx) 


-[ JP toay(t) at Fae) 
m f > f Loa) (t) F (da) dt 


= E(1po,xy(t)) dt a P(X > t)dt. 
0 0 


If X <0 a.s., then —X > 0 as. and 


E(X) = —E(—X) =~ m P(X >t)dt= f F(t) dt 


by elementary arguments. 


An immediate consequence of the preceding result is, for example, the 
following conclusion: if the rv X satisfies X > 0 a.s. and E(X) = 0, then 
X = 0 a.s. This conclusion is often used in what follows as well. 

Proof (of Proposition 1.2.1). Check that u := E (x) = I'(1—p7) (or see 
equation (2.23)). From Lemma 1.2.2, we obtain 


E (max (\x;| z:)) = 1 P ( max (|x;| Z;) > t) dt 
1<i<d 6 1<i<d 


“fe S 
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a 
The substitution t > t aa |z” ) ” /u implies that the integral above equals 


= 


(Dia jail”) i a 1 — exp (-3) dt 


p P 
z i f p (x1? >t) dt 
|x| 
E GO Z (x) 
= |jæll,- 


THE HÜSLER-REISS D-NORM 


Let the rv X = (X1, ..., Xa) follow a multivariate normal distribution with 
mean vector zero, i.e, E(X:) = 0, 1 < i < d, and covariance matrix 
A S= (Tij)i<ij<a = (E(XiX;))i<i,j<a- Then, exp(X;) follows a log-normal 
distribution with mean exp(o;;/2), 1 < i < d, and thus, 

O71 


Z = (Zi,..., Za) := (exp (xı — m) +++, @XP (xa — m) (1.6) 
is the generator of a D-norm, called a Hüsler-Reiss D-norm. This norm de- 
pends only on the covariance matrix X, and therefore, it is denoted by ||-|R,- 

In the special case where X is a finite dimensional margin of a Brownian 
motion (By)i>0, ie., X = (Bi,,...,Br,), O0 < ti < -+ < ta, we compute the 
bivariate projections 


II(z, lars, = B(max(|2] Zi, ly] Z;)), ty ER, 


in Lemma 1.10.6. For more general computations we refer to Krupskii et al. 
(2018) and the literature cited therein. 

Recall that an arbitrary positive semidefinite d x d-matrix X defines the 
covariance matrix of a multivariate normal distributed rv X = (X1,..., Xa) 
with zero means. With the corresponding Hiisler-Reiss D-norm ||-||yp,., we 
have thus defined a mapping from the set of positive semidefinite dx d-matrices 
into the set of D-norms on R¢. This mapping is, however, not one-to-one: 
Denote by E that dx d-matrix with constant entry one. Then, the two matrices 
X and X + AE, with an arbitrary number A > 0, generate the same Hiisler— 
Reiss D-norm ||: |nr; see Example 1.9.2. 
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GENERATORS WITH UNIQUELY DETERMINED DISTRIBUTION 


We have seen already that neither the generator nor the distribution of a D- 
norm is uniquely determined. Actually, if Z = (Z1,..., Za) is the generator 
of a D-norm, then Zx := (2,X, Z2X,...,ZaX) generates the same D-norm 
if X is an rv with X > 0 and E(X) = 1, which is also independent of Z. This 
is easy to see and closely related to the multiplication of D-norms as defined 
in Section 1.9. 

The distribution of a generator Z = (Z1,..., Za), whose first component is 
the constant one, however, is uniquely determined. This is an obvious conse- 
quence of Lemma 5.1.1. An inspection of its proof shows that this conclusion 
remains valid if Z; = 1 for some i € {1,...,d}, i.e., we have the following 
result. We state it here for the sake of completeness. 


Lemma 1.2.3 Let Z = Agee z = Za) 


be generators of the same D-norm ||-||, on R? with the property that, 


for some index i € {1,...,d}, we have Ze == Ze) Then, the 
distributions of Z and Z® coincide. 


1.3 Takahashi’s Characterizations 


The following result shows that the two extremal D-norms ||-||,, and ||-||, are 
already characterized by their values at just one point. 


Theorem 1.3.1 (Takahashi (1987, 1988)) Let ||-|| be an arbitrary 
D-norm on R°. Then, we have the equivalences 


© Wllp= Wh, = 3y >0E€R*: [yl = lull 


& IHl =le = ltl = 1 


Corollary 1.3.2 For an arbitrary D-norm ||-|| on R¢, we have 


ls 
Ilo = tte = |1llb= 
1? : 


Proof. To prove Theorem 1.3.1 we only have to show the implication “<=”. 
Let (21,...,Zq) be a generator of ||-|| 5. 
(i) Suppose we have ||y||,5 = ||y||, for some y > 0 € R4, i.e 


d d 
B( max, (ui) -u= Ynez) =E (Yuz). 
i=1 i=1 
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This entails 


=i 


d d 
E (>: nzi) -E (ps (iz) = (Souza — max (yiZs) 
S————— 


>0 


> Suz i — max, x (Yi Zi) =0 a.s. 


> Suz <= max, x (YiZi) a.s. 


Recall that y; > 0 for all i. Hence, Z; > 0 for some i € {1,... 


Z; =0 for all j Ai. Thus, we have, for arbitrary x > 0 € R, 


d 
5 iZi = max (ziZi) a.s. 
a 1<i<d 
d 
> E bs az) =E ( pgs (e12) 
> ely =ll2llp- 


(ii) We have the following list of conclusions: 


>0 
= max Z,—2Z;=0 a.s., 1<j<d, 
1<i<d 
Zi Sa Saam Ld max Z; a.s. 
1<i<d 
>E (p (Jæ: z3) =E ( max (|x| z1)) 
1<i<d 1<i<d 
= E(ljæll 21) 
= |x|, F(Z) 


)=0 


,d} implies 
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SEQUENCES OF D-NORMS 
Theorem 1.3.1 can easily be generalized to sequences of D-norms. 


Theorem 1.3.3 Let ||:||p, n €N, be a sequence of D-norms on R. 


(i) VeeER®: |ællp, > leh = 3y > 0: lylo, > lull 


2 a 


nm noo 


ai d.: 
(ü) Yæ ERA: |ællp, > læls = ltl, >. 1 


Corollary 1.3.2 carries over. 


Proof. Let (a, srid zy) be a generator of ||-|| >. Again, we only need to 


show the implication “=”. 
(i) We suppose ||y|], — ||¥llp,, —>n> 0 for some y > 0 € R°. With the 


notation Mj := {yz = MaXxı1<i<d vz} and 1m, denoting the indicator 
function of the event Mj, we get for every j = 1,...,d 


d 
z _ (n) _ (n) 
lul - lylo, = E | $u: — max v2” 


>0 


d 
= (n) 
(Eemer) 


Bl Suz ae 


iZ 


d 
=Y nn Ai) a 


=1 
iAj 


vo. 


since the left-hand side of this equation converges to zero by assumption: 
llull — llull p, n=% 0 and the right-hand side is non-negative. As y; > 0 for 
all i = 1,...,d, we have 


E (Zu) >a 0 (1.7) 


for all i Æ j. Choose an arbitrary 2 € R¢. From inequality (1.4) we know that 
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0 < lel, — Welly, 


d 
=E |Y leZ — maz ol Z 
i=1 ~~ 


>0 
d f/d 
<E 5 (Srazi — max |z;| Z ) lm, 
j=1 \i=1 
- Z (n) (n) 
-Ze ((L ez = aa a ) 


d d 
= 5 5 jzi| E (a ie) nso 0, 


i=1 
ižj by (1.7) 
a yo 


n— co 


which implies 
lælp, neo |l#ll,, 2 €R*. 


(ii) We use inequality (1.4) and obtain 


0 < |lallp,, = lell 


Si<d < 
= |||, G (sna a] = 1) 
1<i<d 


CHARACTERIZATIONS BY BIVARIATE PROJECTIONS 


Let ||:||p be an arbitrary D-norm on R? with generator Z = (Z1,..., Za). 
Then, || (2, vllo, := E(max(|2| Zi, y| Z;) = ||ve; + yejzl||p defines a D-norm 
on R?, for 1 < i < j < d, with generator (Z;, Z;). Recall that e; denotes the 
i-th unit vector in R4, 1 <i < d. 

The preceding results provide characterizations of the extremal D-norms 
Illo and ||-||, by their values at a single point in R?. Both can also be char- 
acterized by the corresponding families of bivariate projections {||e; + e;||p : 
1<i< j <d}. This is the message of the subsequent results. The convergence 
of norms is meant pointwise as in Theorem 1.3.3. 
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Theorem 1.3.4 Let ||||p,; n € N, be a sequence of D-norms on R°. 
We have the equivalences 


© Wp, eo ll = Y1<1<3 <d: le: + eslo, 2% 


(ii) llos 2, He = 


i € {1,..., d} Vj Ai: |e: +e;llp, 


= E (Z{® + Z™) — E (max(z{”, z) 


(n) (n) (n) 7(n) 
> E (e + Z; = max(Z; 5 Z; )) l {z mai cia 2»}) 


= (n) 
= E (z lzm maxisrsa g) >0. 


Therefore, E 271 >n 0, which is (1.7). We can 


ge =Maxi<k<d Ze 
repeat the steps of the preceding proof and get the desired assertion. 
(ii) For our given value of i, we have 


j=l 
= (n)_gln)\ _ y(n) 
= Y E| (max (z; Z; ) — Z; ) lf 76) maxiseza a3) 
j= 
d 
<E (max (z zm) gm) 
j=1 
= Ð (le+e;ln,-1) >n» 0, 
1<j<d, jżi 


which proves the assertion according to part (ii) of Theorem 1.3.3. 


The following consequence of the preceding theorem is obvious by putting 
Hlo, = Wrllb- 
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Corollary 1.3.5 Let ||-||,, be an arbitrary D-norm on R°. We have the 
characterizations: 


@ lHo =I <> Visi<j<d:ller+ejllp =2=llec+egll,, 
(@) lllo =Wlleo = 31€ {1,-.-,d}VI AA: lei + Elly =1. 


1.4 Convexity of the Set of D-Norms 
A convex combination of two D-norms is again a D-norm. 


Proposition 1.4.1 The set of D-norms on R? is convex, i.e., if \|-||p, 
and ||-||p, are D-norms, then 


II:llnp, +a—» D2 = Illo, EUA) Illo, 


is a D-norm as well for each à € [0,1]. 


Take, for example, a convex combination of ||-||,, and ||-||,: 


d 
lælo, = Alleles + 1 — A) lell = A max esl + (1 — A) 3 lz:|. (1.8) 
This is the Marshall-Olkin D-norm with parameter A € [0, 1]. 


Proof (of Proposition 1.4.1). Let € be a rv with P(E = 1) = à= 1-— P(€ = 2) 
that is independent of Z and Z®), where Z™ and Z) are generators of 
Illo, and |ll|p,- Then, Z := ZS) is a generator of II-llxp,+a—»)p.> as we 
have, for x > 0 € RI, 


2 
E max PA =E max MASAT =j} 
1<i<d a — i =j 
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By putting x; = 1 and z; = 0 for j 4 i, we obtain in particular Æ Ga =; 


1 <i < d. This completes the proof. 


A BAYESIAN TYPE OF D-NORM 


The preceding convexity of the set of D-norms can be viewed as a special case 
of a Bayesian type of D-norm, as illustrated by the following example. 


Consider the family {i-l p> 1} of logistic D-norms as defined in (1.1). 
Let f be a probability density on [1, 00), i.e., f > 0 and ff? f(p) dp = 1. Then, 


lll, = f lel, fd, æ ERË, 


defines a D-norm on R¢. This can easily be seen as follows. Let X be an rv on 
[1,œ0) with this probability density f(-), and suppose that X is independent 
from each generator Z) of Illp P 2 1. Then, 

Z:= Z%) 


generates the D-norm ||-|| ¢: 


and 


If we take, for instance, the Pareto density f,(p) := Ap~ “+, p > 1, with 
parameter À > 0, we obtain 


oo p 1/p 
læll;, = f (>. ar) Ap +”) dp, x eR’. 


i=l 
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1.5 When Is an Arbitrary Norm a D-Norm? 


The obvious question When is an arbitrary norm ||-|| a D-norm? is answered 
by Theorem 2.3.3: if and only if (iff) the norm ||-|| is standardized, i.e., ||e,|| = 
1,1<:i< d, and 

G(x) = exp (— ||(min(z;, 0) ||) = exp(— ||min(a, 0)||), 
x = (%1,...,2a) € RÊ, defines a df on Rê. If G is a df, then each univariate 


margin is the standard negative exponential df exp(x), x < 0. 
The function G defines a df iff it is A-monotone: For a < b € R? 


aa yl Sama palm... pam) > o. 
me{0,1}4 

The remaining properties that G has to satisfy to be a multivariate df, such 

as its continuity from the right, are automatically satisfied by its definition; 


see, e.g., Reiss (1989, (2.2.19)). Hofmann (2009) established the following 
characterization. 


Theorem 1.5.1 (Hofmann, 2009) Let ||-|| be an arbitrary norm on 
Rt. Then, the function G(x) = exp(—||x||), x < 0 € R, defines a 
multivariate df iff the norm satisfies 


ye (—1)4t1-2aca™s 


m€{0,1}¢: m;=1, ieK 
Ree eee So (1.9) 


for every K C {1,...,d}, K A {1,...,d}, and —œ0 < aj <b; <0, 1< 
j<d. 


An extension of the previous characterization, with the norm ||- || replaced 
by a homogeneous function, was established by Ressel (2013, Theorem 6). 


THE BIVARIATE CASE 


Putting K = {1} and K = {2}, condition (1.9) reduces in the case of d = 2 
to 
|| (b1, 62) |] < min (|| (b1, a2)|| , || (a1, b2)II), a< b< o0, 


which, in turn, is equivalent to 
bl] < llall, a<b<o, 
i.e., the monotonicity of ||-||. With K = 0, inequality (1.9) becomes 
I| (a1, a2)l| + ||(b1, b2)l| < I|(a1, b2)|| + ||@1, a2) - 
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But this is true for every norm on R?, as we show next. Suppose that 
a # b. Put 


— aıb2 = by be fae biaz = bib 
5 aiaz — by be’ O aiaz — bıb2 
_ 4142 — aıb2 _ 4142 — biaz 
a1đ2 — bibo D a1đ2 — bibo ` 


Then, œ, 8,y,6>0,a+y=1=6+46, 
a = 7(b1,a2) + 6(a1,b2), b = a(bı,a2) + B(a1, b2) 
and, hence, the triangle inequality implies 


llall + b|| = |y (b1, a2) + 6(a1, b2)l| + ||a(b1, a2) + 2(a1, bə)|| 
< q ||(b1, a2)|| + 8 || (a1, b2)|] + a ||(b1, a2)|| + £ || (a1, b2)ll 
= ||(b1,a2)|| + I| (a1, be) IL. 


We thus obtain from Theorem 1.5.1 the following characterization in the bi- 
variate case. 


Lemma 1.5.2 Take an arbitrary norm |\-|| on R?. Then, 


G(x) =exp(- |æ), æ <0, 


defines a df in R? iff the norm ||-|| is monotone. 


The following lemma entails that in the bivariate case, G(x) = exp(— ||x||), 
æ < 0 € R?, defines a df with standard negative exponential margins iff the 
norm ||| satisfies |x|, < |x|] < [xlly, æ > 0. 


Lemma 1.5.3 Let ||-|| be a norm on R¢. If ||-|| is monotone and stan- 
dardized, then we have, for0 < x € RÌ, 


Ixl < ll] < lll - 


For d = 2, the converse statement is also true. 


The following characterization of a D-norm on R? is a consequence of the 
previous results. 


Corollary 1.5.4 A radially symmetric norm ||-|| on R? is a D-norm 
iff, forO< x € R?, 


Ixl < [Pell < lll - 
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The preceding equivalence in R? is not true for a general dimension d. 
Proof (of Lemma 1.5.3). Let 0 < x = (x1,...,2a) € R. Since the norm is 
standardized, we have by the triangle inequality 

I|(w1,.-.5@a)|] < ||(@1,0,.-.,0)|] + -++ + 1(0,.-.,0, ra) 
SO te taS ||(£1; -< £a)lla 
Furthermore, we obtain from the monotonicity of ||- || 
KESTE <- za )|| 2 I|(0, .. .,0,2;,0. vg) = Xj llei| = ti, i< d, 
and, thus, ||a|| > max(zxı,..., £a) = ||x||,,. Overall, we have ||x||,, < ||x|| < 


Ixl- 
Now let d = 2 and suppose that the norm satisfies ||x||,, < |||] < ||x|], 
for 0 < x. Then, we have 


1 = fjell < lle < Ileal; =1 


and, thus, the norm is standardized. 

Take a = (a1,a2), b = (b1,b2) € R? with O < a < b and O < b. The 
condition ||x||,, < ||x|| implies b; < max(bı, b2) = ||b||,, < ||b]| for i = 1,2. 
From the triangle inequality we obtain 


bi — a 
l|(a1,b2)|] = |= (0, b2) + 4 (b1, b2) 
by by 
bi -—a a 
< +— ||(0, b2)|| += II(b1, b2)I| 
b Vm by 
=b2<||b]| 
b-a a 
< (He + E) erb) = b 
1 1 
and 
llall = Ila, a2)|| 
b a 
=| 2  (an,0) + (a,b) | 
2 
bs — a a 
<== (a. Ol] + arb) 
2 —— —L— 


=aı bı <||b]| <||b||, see above 


bz — ag a2 
< — } ||b|| = [bl] . 
< (222+ 2) Ibl = 1b 


Therefore, the norm is monotone. 


1.6 The Dual D-Norm Function 


A D-norm is defined by ||æ||p = E (maxı<i<al|x:i| Z;)). Replacing max by min 
in this definition yields the dual D-norm function. It plays a particular role 
when computing survival probabilities in multivariate extreme-value theory 
(MEVT). In this section, we collect elementary properties. 


20 1 D-Norms 


There is a close relation between the maximum and the minimum of 
real numbers, which in particular leads to the inclusion—-exclusion principle 
in Corollary 1.6.2. In what follows, we denote the number of elements in a set 
T by |T]. 

REPRESENTATION OF MAXIMA BY MINIMA AND VICE VERSA 


The following lemma can easily be proved by induction using the equation 


min(max(aı,..., an), @n41) 


= max(min(a1, @n+41),---,min(@n,@n41)). 


The case n = 2 is obvious: 


max(aı, a2) = a, + a2 — min(ay, a2). 


Lemma 1.6.1 Arbitrary numbers a1,...,an E R satisfy the equations 
max(qay,... 
min(a1,... X 
AECI mny 
By choosing a, = --: =a, = 1, the preceding result implies in particular 
i= +. pe: (1.10) 
OAT C{Il,...,n} 


The inclusion-exclusion principle turns out to be a straightforward conse- 
quence of Lemma 1.6.1. 


Corollary 1.6.2 (Inclusion—Exclusion Principle) Let <Aj,...,An 
be measurable subsets of a probability space (2, A, P). Then, 


Us). orl) 


OATES enn} i€T 


P (À a) =e AER (U a) i 


i=1 OAT C{Il,...,n} ieT 
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Proof. Choose w € Q and set a; := a;(w) := 1la,(w), 1 < i < n. From 
Lemma 1.6.1, we obtain 


= _4\ITl-l m; 
max tae) = SD CDT min 14w). 
Pa OATC{I,...,n} 


Taking expectations on both sides yields 


= —1)ITI-1 i 
P(mmta)= D DPE (mip ta). 
= OAT C{l,...,n} 


by using linearity of expectation. The first equation in Corollary 1.6.2 is now 
a consequence of the equalities 


i ifw € UP, Ai, 
0, ifw ZU, Aj, 


1l<i<n 


max 1l4,(w) = 


1, ifweé€NierAi, 
0, ifw ¢MerAi, 


ier 


min 14,(w) = 


yielding 
E ( max 1a.) =P (U4) , E (pta) =P (qa) ‘ 


Repeating the preceding arguments implies the second equation in Corol- 
lary 1.6.2 as well. oO 


Corollary 1.6.3 If Z), Z) generate the same D-norm, then, for 
each « € R¢, 


> ( min, (Iz: z®)) a ( min, (Ix 2) Í 


Proof. Corollary 1.6.3 can be seen as follows: 


i (i, (æ: z®)) 


-E y (1/7 max (lz: z) 
OAT C{1,...,d} 


a 2 (-1)7™1B (max (el 29) ) 


OAT C{1,...,d} 
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= >» JDP sle 


OAT C{I,...,d} jET D 
= 5 (1) TI-E (as (12®)) 
OATC{I,...,d} JET 
=E| Ð (=1)T-max (lz: 2?) 
0ATC{L,...,d} IS 


=E ( min (lz: z®)) . 
1<i<d 


DEFINITION OF THE DUAL D-NORM FUNCTION 


Let ||-||) be an arbitrary D-norm on R? with arbitrary generator Z = 
(Z1,..., Za). Put 


Rep :=E ( min (|x| z:)) ; x ERI, (1.11) 
1<i<d 


called the dual D-norm function corresponding to ||-|| 5. By Corollary 1.6.3, 
it is independent of the particular generator Z, but the mapping 


lllo > 8: Wd 


is not one-to-one. Consider, for example, the generator Z = (Z1, Z2, Zs), 
which attains each of the two values (2,0,1), (0,2,1) with a probability 1/2. 
As a consequence, mini<;<3 Zi = 0. The corresponding D-norm ||-||p has a 
dual function }: p = 0, but ||-||, does not coincide with ||-||,, which also has 
a dual function 2-2, = 0: 


Il (2, 1,3)|lp = E(max(2Z1, Z2,3Z3)) = 7/2 
# I|(2, 1,3)||, = 6. 


Clearly, 
l- p =0 (1.12) 


is the least dual D-norm function, corresponding, for example, to ||-||,, = Illl 
if d > 2, whereas 


Rep = min |zr;| = Va No, x eR’, (1.13) 
1<i<d 
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is the largest dual D-norm function, corresponding to ||-||,) = ||-||,,. This 
follows from the inequality 


n 
1<i<d 


[zp] = E(|er| Ze) = E ( min (|z;| z,)) , tered (1.14) 


Thus, for an arbitrary dual D-norm function and d > 2, we have the bounds 
O= RW LU- UD KV: oo. 
The dual D-norm function is obviously homogeneous of order one, i.e., 
VAL UD = |A| Nw, AER, eeER’. 


Choose an arbitrary D-norm on Rê with generator Z = (Z),..., Za), and 
denote by ||-||5,, that D-norm on RITI, which is generated by Zr := (Z;)ier, 
OAT C {1,...,d}. Then, the mapping 


II lp > Ror: OAT C {1,...,d}} 
is one-to-one according to Lemma 1.6.1. 


Example 1.6.4 (Marshall—Olkin Model) The Marshall—Olkin D- 
norm ||-||p, in (1.8) with parameter A € [0,1] has for d > 2 the dual 
D-norm function 


= : = d 
a eels T= (treta) ERS 


This is an immediate consequence of the fact that ||-||p, has the 
generator ZS) as in the proof of Proposition 1.4.1, together with 
VE Noo = mini<i<a|xi| and Væ V = 0. 


Example 1.6.5 (Weibull Model) We can define a generator Z = 
(Z1,...,Za) by taking independent and identically Weibull distributed 
tv Z1,...,Za, ie., P(Z, > t) = exp(—t?), t > 0, p > 1, and putting 
Zi := Ž;ı/T(1 + p7"). It is easy to see that the corresponding dual 
D-norm function for x € R¢, x; Æ 0,i=1,...,d, is given by 
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Hence, according to Lemma 1.6.1, the corresponding D-norm for 
those æ, whose components are all different from zero, is given by 


a syne o 
ltlw,= X Cd [G/edierl, 


OAT C{1,...,d} 


1.7 Normed Generators Theorem 


In this section, we establish in Theorem 1.7.1 the surprising fact that for any 
D-norm ||:||p on R? and an arbitrary norm ||-|| on R? there exists a generator 
Z of ||-||p with the additional property ||Z|| = const. The distribution of this 
generator is uniquely determined, and enables in particular the metrization 
of the set of D-norms in Section 1.8. 

The results in this section actually laid the groundwork for MEVT, as in 
Balkema and Resnick (1977); de Haan and Resnick (1977), and Vatan (1985). 


EXISTENCE OF NORMED GENERATORS 


The main Theorem 1.7.1 follows from a sequence of auxiliary results, which 
we establish below. 


Theorem 1.7.1 (Normed Generators) Let ||-|| be an arbitrary 
norm on Rt. For any D-norm ||-\|p on R?, there exists a genera- 
tor Z with the additional property ||Z|| = const. The distribution of 
this generator is uniquely determined. 


Corollary 1.7.2 For any D-norm ||:||p on R, there exist generators 
Z, Z) with the property So z® = d and maxı<i<a z® = 
const = ||(1,..., Dl p- 


Proof. Choose ||-|| = ||-||, in Theorem 1.7.1. Then, 
í (1) 
t= zol =y Z 

cons | To 


Taking expectations on both sides yields 


const = 5 E (2) =d. 


i=1 


Choose ||-|| = ||-||,, for the second assertion. 
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Example 1.7.3 Put Z® := (1,...,1) and Z@) := (X,...,X), where 
X > 0 is an rv with E(X) = 1. Both generate the D-norm ||-||,,, but 
only Z® satisfies || Z|], = a. 


Example 1.7.4 Let Vi,...,Vqaq be independent and identically gamma 
distributed rvs with a density yq(x) := £°~!exp(—2)/I'(a), x > 0, 
a > 0. Then, the rv Z € R? with components 


: Vi 
Zi := , 
Vi +e + Va 


follows a symmetric Dirichlet distribution Dir(a) on the closed simplex 
a= {u> 0ER: D u; = 1}; see Ng et al. (2011, Theorem 2.1). 
We obviously have E(Z;) = 1/d, and thus 


Z :=dŽ (1.15) 


is a generator of a D-norm |||| pia) on R?, which we call the Dirichlet 
D-norm with parameter a. We have in particular ||Z||, = d. 
d 
It is well known that, for a general a > 0, the rv (vi/ y V;) - 
and the sum Sa V; are independent; see, for example, the proof of 
Theorem 2.1 in Ng et al. (2011). Since E(Vi +---+Vu) = da, we obtain, 
for x = (%1,...,24) € RF, 


(1:129) 


Ha e( 


maxi<i<a (|x| w) 
( Vite +Va 


max 
1<i<d 


maxy<i<a (|x| = 
Vee 


(suax,(lesl¥0) 


Therefore, a generator of ||-|| pa) is also given by a~'(Vi,..., Va). 


In view of the preceding results, one may ask: given a generator Z of 
a D-norm ||-||,, does there always exist a norm ||-|| with the property that 
||Z|| = const? The answer is “no.” Take, for example, an rv X that follows 
the standard exponential distribution on R. Then, Z := (X,...,X) € Ri 
generates the sup-norm. ||-||_,. Suppose that there exists a norm ||-|| with the 
property ||Z|| = const. This would imply that ||Z|| = X ||(1,...,1)|] = const 
a.s., which is obviously not true. 
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THE DERIVATION OF THEOREM 1.7.1 


The derivation of Theorem 1.7.1 is achieved in a sequence of single steps. 
Throughout the rest of this section, let ||-|| be an arbitrary norm on 
R? and let ||-\|, be a D-norm on R? with an arbitrary generator Z = 
(Z1, eee sfa): Put 

Se:= {s> 0ER?: Isl =c}, 
where c := E(||Z||). First, we show that c is a finite positive number. The 
triangle inequality implies 


d 
X Ziĉi 
i=l 


Iz] = 


d d 
< X` Ziel] = >> Zille: 
i=1 i=1 


and, thus, 
d 


EZI SDA Z;) lleill = le < 00. 


We have, moreover, 
= E(|| Z|) > 0. 
Otherwise, F(||Z||) = 0 would imply ||Z|| = 0 a.s. as the integrand is non- 
negative, and thus, Z = 0 a.s. But this clearly contradicts the condition 
E(Zi)=1,1<i<d. 
The o-field of the Borel subsets of Se is given by 


Bs, := {BN Se: B € BY}, 


where B@ denotes the common Borel o-field in R¢. 


INTRODUCING THE ANGULAR MEASURE 


Put E := (0,00) \ {0} c R? and AB := {\b: be B} for an arbitrary set 
BcEandX> 0. 


Lemma 1.7.5 For A € Bs., put 


lE (Ir,.a(Z) IZI), 


where R,-A:= {àa : A >0,a € A}. Then, B(-) is a probability measure 
on Se, equipped with the o-field Bg,. It is commonly called angular 
measure. 


Proof. The set function ® is obviously ne with 


P(S.) = ŽE (In,-s.(2) IIZI) = SE (1e(Z) IZI) 


= TEZ #0) ||Z||) = ~E(\\ZII) = 
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It remains to show the o-additivity of &. Let A;, i € N be a sequence of 
pairwise disjoint sets in Bs, and put Bn := UP, Ai, n € N, Bo := UienAi. 
Then, > la, = 18,„, n € N, is a monotone increasing sequence of functions 
on Se with limnen lg, = 1g- The monotone convergence theorem implies 


c®(Bx) = E (1r,.8,,(Z) ||Z]|) 
= p ( (tip 12,8, (2)) II) 
= lim E (1r,-B, (2) lll) 
= Tim E (lup ,n,-a (2) |Z). 


Note that the sets Ry - A1, R4 - Ag,... are pairwise disjoint as well: suppose 
that Aja; = Aza; for some 1, A2 > 0 and a; € Aj, aj € Aj. Then, 


and, thus, 
2 2 
c= lail = Ž la;l = Že 


i.e., Ay = Ag and, hence, a; = aj, i.e., i = j. We therefore obtain 
cP(Bao) = lim E (5 Ir, -a,( w) iz) 
= im > E (1r,.a,(Z) IZII) 
i=1 


= uoa &(A;) = c` S(A 


iEN 


i.e., ® is a probability measure on Bs.. 


Lemma 1.7.6 The angular measure P in Lemma 1.7.5 satisfies, for 
EEC ee Td) >O0eER?, 


Se 1<j<d 


max (2383) (ds) = E ( max,(052Z,)) = lelin. 


Proof. For a fixed x > 0 € RÎ, the function f : Se —> [0,00), defined by 
f(s) := maxı<j<a(zjsj), is continuous and non-negative. It is in particular 
Borel-measurable, and thus, there is a sequence of elementary functions 
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In = >»: Qin lAn» neéN, 


with A;n € Bs, and ain > 0,1 <i < m(n), n € N, such that f(s) tren f(s), 
SES. 
By applying the monotone convergence theorem twice, we obtain 


max (x;s;) &(ds) =f fd 


Se 1<j<d 


= lim 1 l fn UP 
m(n) 
ade 
m(n) 


en odd, 
= lim z >, Qin E (IZI IR} -Ain (Z)) 


= lim lE IZI XO ainle, Ain (Z) 


i=l 


m(n) 
1 Zc 
= lim zE IZI XO ainle, sin (Gr) 


i=l 


= (121 (Tz) 


=E ene) = leli- 


E 
The following consequence of the two preceding auxiliary results is obvi- 
ous. 


Corollary 1.7.7 Let Y be an rv with the distribution ®. Then, Y gen- 


erates the D-norm ||:||p and satisfies ||Y || = c a.s. 


The preceding result ensures the existence of a normed generator in The- 
orem 1.7.1. 


THE DISTRIBUTION OF A NORMED GENERATOR IS UNIQUE 


Next, we establish the fact that the distribution of a normed generator in 
Theorem 1.7.1 is uniquely determined. To achieve this, the following lemma 
is helpful. 
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Lemma 1.7.8 There is a unique measure v on the Borel o-field of 
E = (0, 00)*\ {0} with 


v((A, 00) - A) ; A>0, A EBs.. (1.16) 


Proof. Use polar coordinates to identify the set E with (0, o0) -Se and identify 
v in these coordinates with the product measure u x ® on (0,00) x Se, where 
the measure u on (0,00) is defined by p((A,0oo)) = 1/A, A > 0. Precisely, 
define the one-to-one function T : (0,00) x Se > E by 


T(A, a) := Aa 
and put for an arbitrary Borel subset B of E 
v(B) := (u x )(T~'(B)). (1.17) 


Then, v is a measure on the Borel o-field of E with 


Since the set M := {(A,co): A: A >0,A € Bs.} generates the Borel o-field 
of E (use the function T to prove this) and M is stable with respect to 
intersection, i.e., if Mı, M2 E€ M, then Mı N Mz E€ M (check this as well), 
the measure v is uniquely determined by the property (1.16). But this is the 
assertion. o 


Lemma 1.7.9 The measure v from Lemma 1.7.8 satisfies 


1 
v ( [0, æ] 2>OeR?. 
Xp 


= 1 I S 
Se J{A>0: Ase} 
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= pA > 0: As; > x; for some 1 < i < d) &(ds) 
Se 


: Ti 
= f w(aro: Aem, (=)) (ds) 


1 
=| ——— 64 
L min <i<a(2i/$i) ee 


i 1 
oC ee 
S. 1Si<d Ti Tilp 


Corollary 1.7.10 The distribution of a generator Z of ||||p with 


I| =c is uniquely determined. 


Proof. Let Z), Z) be two generators of ||-||) with ||Z|| = ||Z©|| = c. 


For A € Bs, and i = 1, 2, put 
) =£ (tra (Z°)). 


2a) Le (Inca (Z0) fa" 
Then, we obtain 
(4) = E(14(2)) =P(2 €A), 
i.e., ® is the distribution of Z, i = 1,2. 
As the measure v defined in Lemma 1.7.8 is, according to Lemma 1.7.9, 


uniquely determined by the D-norm ||-||,,, we obtain 


P (20 € A) = (A) = v((1,00) - A) = (A) = P (2° é A) , 


which completes the proof. 


THE EXPECTED NORM OF ANY GENERATOR IS A CONSTANT 


The preceding results comprise the Normed Generators Theorem 1.7.1, but 
they yield the following consequence as well. 


Corollary 1.7.11 Let ||-||, be an arbitrary D-norm and let ||-|| be an 
arbitrary norm on R¢. Then, there is a constant const > 0 such that, 
for every generator Z of ||-||p, 


E(||Z||) = const. 


1.7 Normed Generators Theorem 31 


Proof. Let Z), Z?) be two generators of ||-||,. For i = 1,2, put c; := 
E(||Z||), Si = {8s >0€ER*: ||s||=c:} and S; as in Lemma 1.7.5. We 
have S2 = (c2/c1)S1. Since the measure v defined in Lemma 1.7.8 depends 
only on ||:||p according to Lemma 1.7.9, we obtain the equations 


Ba(S2) = v((1, 00) j S2) 


But 1 = 1 (S1) = (S2) and, thus, c1/c2 = 1, which completes the proof. 


For example, for ||-||p = ||-||,, and an arbitrary norm ||-|| on R?, we obtain 
that each generator Z of ||-||,, satisfies 


EZD = 0,- DI. 


This is immediate from the fact that, in particular, Z = (1,...,1) € R 
generates ||-||,, 
An arbitrary generator Z of ||-|| 5 = ||-||, satisfies 


E(||41l) = Ye. 


This follows from the fact that a random permutation of (d,0,...,0) € R? 
with equal probability 1/d generates the D-norm ||-||,. In this case, we have 


d 
E(||Z\|) -$ lel} = X- |le:l. 
i=l 


EXTENDING THE NORMED GENERATORS THEOREM 


The Normed Generators Theorem 1.7.1 can be extended significantly by re- 
placing the norm ||-|| with a general radial function R : [0,00)4 + [0,00) as 
follows. 


Definition 1.7.12 A Borel subset S of (0,00)? is a complete angular 
set if, for every x € [0,c0)4, x #4 O € Rf, there exists a uniquely 


determined vector s € S and a uniquely determined number r > 0 such 
that x = rs. 


For æ = rs with s € R? and r > 0, put 


R(x) :=r 
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and R(0) = 0. Note that the radial function R is homogeneous of order one, 
i.e., R(Ax) = AR(x), A> 0, x € [0, 00)%. If æ = rs with s € S, then Ax = Ars 
and, thus, 

R(x) = Ar = AR(a). 


Repeating the arguments in the derivation of Theorem 1.7.1, the conclusion 
in this result can be generalized as follows. 


Theorem 1.7.13 (Extended Normed Generators Theorem) 
Let S be a complete angular set in RÌ with a corresponding Borel- 
measurable radial function R. For every D-norm ||-|| on R%, there 


exists a generator Z such that R(Z) = const. The distribution of this 
generator is uniquely determined. 


The above result is actually a generalization of Theorem 1.7.1, since not 
every complete angular set S can be generated by a norm ||-|| through S = 
{s €[0,0o)*: ||s|| = c} with some c > 0. For example, put 


S:= {(u,l—u?): we [0,1]}, 


which is a complete angular set in R?. Suppose there is a norm ||-|| on R? such 
that 
S = {(s1, 82) € [0, 1] : |](s1, 52) ll = c} 


for some c > 0. Note that (1,0) and (0,1) are both elements of S, and thus, 
by the triangle inequality and the homogeneity of order one being satisfied by 
every norm, we have for every v € [0,1] 


I|(v, 1 — v) = |e, 0) + 1 — 2), 1)I] 
< vf, 0) + G — v) 10, DI 


=ve+(l—-vj)c=c. 
Choose v € (0,1). Then, there exists À > 1 and u € (0,1) such that 
(v,1—v) =X(u,1—u?). 
As a consequence, we obtain 
II(v, 1 — o)l =Al|(u,1—u?)|] = Ac > e> |[(v,1— v), 


which is a contradiction. 
An example of an unbounded complete angular set in R? is 


S := {(u,1/u) : u > 0}U {(0, 1), (1,0)}. 


Angular sets that are not necessarily complete are introduced in Defini- 
tion 1.11.2. 
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1.8 Metrization of the Space of D-Norms 


Denote by Zj., the set of all generators of a given D-norm ||-||p on Ri. 
Theorem 1.7.1 and Corollary 1.7.2 imply the following result. 


Lemma 1.8.1 Each set Z|.) contains a generator Z with the addi- 
tional property ||Z||, =d. The distribution of this Z is uniquely deter- 


mined. 


Let P be the set of all probability measures on the simplex Sq := {£ > 
0ER: ||æ||ı =d}. Using the preceding lemma, we can identify the set D of 
D-norms on R¢ with the subset Pp of those probability distributions P € P 
that satisfy the additional condition Ssa zi P(dæ)=1, i =1,...,d. 


INTRODUCING THE WASSERSTEIN METRIC 


Denote by dw (P,Q) the Wasserstein metric between two probability distri- 
butions on Sy, i.e., 


:= inf{ E (|X — Y||,) : X has distribution P, Y has distribution Q}. 


As S4, equipped with an arbitrary norm ||-||, is a complete separable metric 
space, the metric space (P, dw) is complete and separable as well; see, for 
example, Bolley (2008). 


Lemma 1.8.2 The subspace (Pp, dw) of (P, dw) is also separable and 


complete. 


Proof. It is sufficient to show that (Pp, dw) is a separable and closed subspace 
of (P, dw). We start by showing that it is closed. Let P,, n € N be a sequence 
of probability measures in Pp that converges to P € P with respect to dw. 
We show that P € Pp. Let the rv X have distribution P, and let X have 
distribution Pa, n € N. Then, 


5 OENES 


i=l 
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= E (|x - x” 


k nEeN. 


As a consequence, we obtain 


d 


x 


i=l 


i x; P(dx) — i < dw(P, Pa) >n>o 0 
Sa 


and, thus, P € Pp. 

The separability of Pp can be seen as follows. Let P be a countable 
and dense subset of P. Identify each distribution P in P with an rv Y = 
(Yi, -.., Ya) on Sa which follows this distribution P, i.e., each component Y; 
is non-negative, and we have Yı + --- + Ya = d. Without loss of generality 
(wlog), we can assume that E(Y;) > 0 for each component. This can be seen 
as follows. Let T C {1,...,d} be the set of those indices i with E(Y;) = 0. 
Suppose that T 4 Ø. As Y; > 0, this implies Y; = 0 for i € T. For n € N, put 


¥ 2a a a) Yi PTER 
nT if tE T. 
Then, Y, Y{® = d and, with Y ™® := Cae ae i), 


J-J Eterna 


ieT igT 


e(r- 


Therefore, the sequence Y ™, n € N, approximates Y arbitrarily closely. We 
substitute Y € P by the sequence fy, nE N} and, as P is countable, 
we can assume wlog that each component Y; of each Y € P has positive 
expectation. 

Finally, put Z = Y /E(Y ). This yields a countable subset of Pp, which is 
dense. 


We can now define the distance between two D-norms ||-||p, and ||-|| p, on 
R? by 
dw (Ih-llp, > Ill.) 


:= inf {E (2 — zel) : Z® generates ||-||p, , |Z® 


=d,i=1,2}. 
1 


The space D of D-norms on R, equipped with the distance dw, is, according 
to Lemma 1.8.2, a complete and separable metric space. 


CONVERGENCE OF D-NORMS AND WEAK CONVERGENCE OF 
GENERATORS 


For the rest of this section, we restrict ourselves to generators Z of D-norms 
on R? that satisfy ||Z||, = d. By +p, we denote ordinary convergence in 
distribution. 
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Proposition 1.8.3 Let ||-||p,; n E€ NU {0}, be a sequence of D-norms 


on Rt with corresponding generators Z™, n e NU {0}. Then, we have 
the equivalence 


dw (lillo, -lllo ) nso 0 <> Z™ >p Z”. 


We see in Corollary 5.1.13 that weak convergence Z (n) +5 ZO of arbi- 
trary generators, whose components do not necessarily add up to d, implies 
pointwise convergence of the corresponding D-norms. 


Proof. Convergence of probability measures P, to P with respect to the 
Wasserstein metric is equivalent to weak convergence together with conver- 
gence of the moments 


| zl], Pa(de) >n | læll, Po(dæ); 
Sa Sa 


see, for example, Villani (2009). But since for each probability measure P € 
Pp we have 


[ lel, Piz) = | aPla)=4, 
Sa Sa 


the convergence of the moments is automatically satisfied. o 


Convergence of D-norms with respect to the Wasserstein metric implies 
pointwise convergence, which is uniform on compact subsets of R. This is 
a consequence of the following auxiliary result, which provides in particular 
the fact that the pointwise limit of a sequence of D-norms is a D-norm; see 
Corollary 1.8.5. 


Lemma 1.8.4 For two arbitrary D-norms ||-|| p; Illp, on R4, we have 
the bound 


lællp, < lællp, + llæll dw (lillo, > I'll.) 


and thus, forr > 0, 


|llællp, = llællp,| < rdw (Illo, + Illo) - 


oo = 


Proof. Let Z be a generator of ||-||p,, i = 1,2. We have 


ley, = E ( max, (11 20) ) 


= e (yp, (lel (2° +2 - z))) 


36 1 D-Norms 


<E ( max (Izl z)) + ||x||,, E (nas be 7 z?°|) 
ai 1<i<d 


< lello, + lel £ (|Z® - ze), 


which implies the assertion. 


THE POINTWISE LIMIT OF D-NORMS IS a D-NORM 


Lemma 1.8.4 entails that the pointwise limit of a sequence of D-norms is again 
a D-norm. 


Corollary 1.8.5 Let ||-\|p,,n € N, be a sequence of D-norms on R? 
such that 
llzllp,, = fæ) 


lim 
n—-oo 


exists in [0,00) for x € R°. Then, f(-) is a D-norm on R¢. 


Choose an arbitrary norm ||-|| on R? and put $j. := {u E€ R°: u 
0, ||u|| = 1}. By using polar coordinates and writing x = ||a|| (a/ ||x|), x > 
x Æ 0, it is sufficient in the previous result to require the limit 


Ž 
0, 


üm lullo, = f(u) 
to exist for u € Se Then, 


lælp = læ fqæl/ læ), æ € R*\ {0}, 
defines a D-norm on R4, where |æ| = (|z1],.-., |£alļ) is meant componentwise. 


Proof. From Corollary 1.7.2 we know that, for every D-norm ||-||p,, there 
exists a generator Z™) that realizes in S4 := {x >0€R?: |x|], =d}. The 
simplex Sy is a compact subset of Rt, and thus, the sequence Z™, n € N, 
is tight, i.e., for each € > 0 there exists a compact set K in R? such that 
P(Z™ € K) > l—-e for n € N; just choose K = Sg. But this implies 
that the sequence is relatively compact, i.e., there exists a subsequence Z“™, 
m = m(n) that converges in distribution to some rv Z = (Z1,..., Za); see, 
for example, Billingsley (1999, Prokhorov’s theorem). 

One readily finds that this limit Z realizes in Sg as well, and that each of 
its components has expected value equal to one. The Portmanteau Theorem 
implies 

lim sup P (z € Sa) < P(Z € Sa) 
nEN 
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as Sy is a closed subset of R". But P (Z™ € S4) = 1 for each m, and thus, 


P(Z € Sa) = 1 as well. The sequence of components Z (m) n € N, is uniformly 


integrable for each i € {1,...,d} as z realizes in [0,d], and thus, weak 


convergence Zim) >p Zi implies Æ (z™) >n>œ E(Z;); see Billingsley 


(1999, Theorem 5.4). But E a = 1, and thus, we obtain E(Z;) = 1 as 
well for each i € {1,..., d}. 

The rv Z is, therefore, the generator of a D-norm ||-||,,. From Proposi- 
tion 1.8.3, we obtain that dw ([[-Ilp, .Il'Ilp) >n=œ 0. Lemma 1.8.4 implies 
that lællp,, > |lal|p, x € R4, and, thus, f(-) = Ilp- 


THE TOPOLOGY OF POINTWISE CONVERGENCE 


We can define a topology on the set of D-norms on R¢ by defining convergence 
as 
d 
lllo, noo lllo, : = Yz ER: æl, >r% lello- 


This generates the topology of pointwise convergence. This topology is ac- 
tually metrized by the Wasserstein metric dy/(-,-), as the following result 
shows. 


Proposition 1.8.6 For a sequence ||-\[p,,n E€ NU{O}, of D-norms on 
Rt, we have the equivalence 


dw (lillo, > lllo) >n% 0 = Ve ER: [all p, >n> lllo, - 


Metrization of the set of D-norms is crucial in Section 1.11, where we 
take a look at them from a functional analysis perspective. In particular, in 
equation (1.38), we define another metric, which, according to Lemma 1.11.15, 
metrizes the topology of pointwise convergence as well. 


Proof (of Proposition 1.8.6). The implication “=” is immediate from 
Lemma 1.8.4 as follows. For arbitrary x € R¢, « 4 0, we have 


Hell, - ell oe] = liell lac -l 
Tlellp, {lTelell », 
< æl dw (Irllp, vlrllp,) Parse 0. (118) 


Next, we establish the reverse implication. Suppose the sequence of D- 
norms ||-||p,, n € N, satisfies ||æ||p, >n=>œæ ||2||p, for each x € R¢, but 
dw (II-llp, rll-llbg) does not converge to zero as n tends to infinity. Then, 
there is a subsequence Morey n € N, with 
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dw (Ilio ollo) 26 nEN, (1.19) 


for some € > 0. 

By Corollary 1.7.2, every D-norm ||-|| 5, has a generator Z™), n € NU{0}, 
that realizes in Sq = {æ > 0 €R®: ||ax||, = d}. By repeating the arguments 
in the proof of Corollary 1.8.5, there is a further subsequence Z™™)) n €N, 
of ZE) n € N, which converges in distribution to the generator Z € Sq of 
lllo) rn-r00 0 
nc ||æl|p for each z € R? by (1.18). This implies 


a D-norm ||:||p. Proposition 1.8.3 implies that dw (ae 


m(n) 

and, thus, lello 

l-l» =I, and 
dw (Ill dca > Illos) >a 0, 


which contradicts (1.19). 


1.9 Multiplication of D-Norms 


The set of D-norms can be equipped with a commutative multiplication-type 
operation, making it a semigroup with an identity element. This multiplication 
leads to idempotent D-norms. We characterize the set of idempotent D-norms. 
Iterating the multiplication provides a track of D-norms, whose limit exists 
and is again a D-norm. If this iteration is repeatedly done on the same D- 
norm, then the limit of the track is idempotent. 


MULTIPLYING Two D-NORMS 


Choose two generators Z™®, Z°) with corresponding D-norms ||-||p,; Ilp, 
on Rt, and suppose that Z“) and Z are independent. Then, 


Z= ZZ) 


is again a generator of a D-norm, which we denote by ||-||p,p,- Recall that 
all operations on vectors, such as the above multiplication, are meant compo- 
nentwise. The components satisfy ZO ZO) > 0 and, by the independence of 
Z and Z), we also have E (22?) =E (z®) E (2?) = 

Clearly, the multiplication is commutative, ||| p,p, = II'Ilp,p,- The D- 
norm ||-|| 5, p, does not depend on the particular choice of generators. 


Lemma 1.9.1 The D-norm ||-||p,p, does not depend on the particular 


2) 


choice of generators Z“), Z), provided that they are independent. 
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Proof. By P * Z we denote the distribution (P * Z)(-) = P(Z € -) of an rv 
Z. For x € Rt, Fubini’s theorem implies 


Ill. = £ ( max, (lel 202!) ) 
= f # (aps, (late) ) (Po 20) GPa) 


— J |e] (P * z) dz) = E (E ) ; (1.20) 
Do D2 


i.e., ||æl|| p, p, is independent of the particular choice Z (2). Repeating the above 
arguments and conditioning on Z), we obtain the equation 


l2llp,p, = E (=z, (1.21) 


i.e., ||æ||p, p, is independent of the particular choice Z () as well. 


THE SUP-NORM IS THE IDENTITY ELEMENT 


For instance, let ||-||, = Illl: We can choose the constant generator Z) = 
(1,...,1), which is independent of any generator Z“), and clearly obtain 
d 
lælp, p, =llellp,, £ ERS, (1.22) 
ien II-Ilp,p, = II-llp,- The sup-norm ||-||,, is, therefore, the identity element 


within the set of D-norms, equipped with the above multiplication. There is 
no other D-norm with this property. 

Equipped with this commutative multiplication, the set of D-norms on R4 
is, therefore, a semigroup with an identity element. 


Example 1.9.2 The set of Hiisler-Reiss D-norms as defined in (1.6) 
is closed with respect to multiplication, i.e., the product of two Hiisler— 
Reiss D-norms is again a Hiisler-Reiss D-norm. This is an immedi- 
ate consequence of the convolution theorem for (multivariate) normal 
distribution: let X = (X1,..., Xa) and Y = (Y,..., Y4) be indepen- 
dent multivariate normal rv with zero means and covariance matrices 
y= (O45 )1<i,j<d; A= (ij 1<i,j<a- Then, the product of the corre- 
sponding Hiisler-Reiss D-norms ||-||p,, and ||-||R,, has the generator 


ZZ) = exp (x +y- 


ci +À Oda + À 
nt e ee s s it). 


The rv 
X+Y =(X14+%N,...,Xa+ Ya) 
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is again multivariate normal, with mean vector 0 € R@ and covariance 
matrix X + A = (oj; + Aij)1<i,j<a, and thus, the product is again a 
Hiisler—Reiss D-norm. In short notation we have 


Illar nRa = lllar» . 


In particular, choose Y = (Y,...,Y ), where Y is a normal rv on the 
real line with mean zero and variance À > 0. Then, Y is multivariate 
normal with mean vector zero and constant covariance matrix A = AE. 
Equation (1.5) implies 

Illar s = ils 
thus we obtain from (1.22) 


II lla HRe = II-era an = Illus, : 


On the other hand, X/n = (0;;/n)i<i,j<a is, for an arbitrary inte- 
ger n € N, the covariance matrix of X//n = (X1ı/vVn,..., Xa/ vn). 
Therefore, the Htisler—Reiss D-norm is multiplication stable, i.e., 


lalara = |i lirs > meN: 


This example is continued in Example 1.9.7. 


THE ABSORBING D-NORM 


Take Z®) as a generator of the D-norm ||-||,. Then, we obtain from equation 
(1.20) 


d d 
lelov =E (ez) =Y eqz (2®) =Z ld, seră, 
i=1 i=1 
ien |l-llp,p, = lll- Multiplication with the norm ||-||, yields ||-||, again, and 


thus, ||-||, is an absorbing element among the set of D-norms. There is clearly 
no other D-norm with this property. 


IDEMPOTENT D-NORMS 


The maximum-norm ||-||_, and the norm ||-||, both satisfy 


Hlo: := llop = IIb - 
Such a D-norm is called idempotent. Naturally, the question of how to charac- 
terize the set of idempotent D-norms arises. This is achieved in what follows. 
It turns out that in the bivariate case, ||-||,, and ||-||, are the only idempotent 
D-norms, whereas in higher dimensions, each idempotent D-norm is a certain 
combination of ||-||,, and ||-||,- 
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The following auxiliary result is crucial for the characterization of idem- 
potent D-norms. This characterization may be of interest of its own. 


Lemma 1.9.3 Let X be an rv with E(X) = 0, and let Y be an inde- 
pendent copy of X. If 


E(X +Y|) = E(X), 


then either X = 0 or X € {—m,m} a.s. with P(X = —m) = P(X 
m) = 1/2 for some m > 0. The reverse implication is true as well. 


Proof. Suppose that P(X = —m) = P(X = m) = 1/2 for some m > 0. Then, 
obviously, 
E(\|X|) =m = E(|X +Y)). 

Next, we establish the other implication. Suppose that X is not a.s. the con- 
stant zero. Denote by F the df of X. We can assume wlog the representation 
X = F-1(U,), Y = F~'(U2), where U1, Uz are independent, on (0,1) uni- 
formly distributed rv, and F~+(q) := inf {t ER: F(t) > q}, q € (0,1), is the 
generalized inverse of F. The well-known equivalence 


F(q) <t = q< F(t),  q€(0,1),tER, 


(see, for example, Reiss (1989, equation (1.2.9))) together with Fubini’s the- 
orem imply 


E(|X +Y|) = E (|F71 (U1) + F7 (U2)|) 


-f f |F- (u) + F71(v)| du dv 
o Jo 
F(0) pF(0) 
-f 1 F~! (u) + F7! (v) du dv 
0 0 
1 
+f F~! (u) + F7! (v) du dv 
F(0) J F(0) 


F(0) pl 
+2 f | |F (u) + F7*(v)| du dv 
0 F(0) 


+ foo ror OES or an) dv 


F(0) 
+2 f fo + F~*(v)| du dv 
on 


F(0) 
= -2F0) | F~! (v) dv 
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and 


From the assumption E(|X + Y|) = E(|X|), we thus obtain the equation 
F(0) 
0=(1- 2F(0)) f F! (v) dv 
0 
1 
+(1 -2F0)) | F+! (v) dv 
F(0) 
F(0) pl 
+2 f | |F-*(u) + F-*(v)| du dv 
0 F(0) 
or 
1 
0=(1- 2F(0)) f F~! (v) dv 
0 
F(0) pl 
+ 2 | | |F! (u) + F*(v)| du dv. 
0 F(0) 
The assumption 0 = E(X) = A F~1(v) dv now yields 


F(0) pl 
| I |F! (u) + Ft (wv)]| dudv = 0 
0 F(0) 


thus, 


F(u) + Ftv) =0 (1.23) 
for A-a.e. (u,v) € [0, F(0)] x [F'(0), 1], where A denotes the Lebesgue measure 


on [0, 1]. 


If F(0) = 0, then P(X > 0) = 1, and thus, E(X) > 0, which would be a 
contradiction. If F(0) = 1, then P(X < 0) > 0 unless P(X = 0) = 1, which 
we have excluded, and thus, E(X) < 0, which would again be a contradiction. 


Consequently, we have established 0 < F(0) < 1. 


As the function F~'(q), q € (0,1), is in general continuous from the left 
(see, e.g., Reiss (1989, Lemma A.1.2)), equation (1.23) implies that F'~!(v) is 


a constant function on (0, F(0)] and on (F'(0), 1), precisely, 


ina J-m, ve (0, F(0)], 
= w= fr v € (F(0),1), 
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for some m > 0. Note that the representation X = F~'(U;), together with the 
assumption that X is not a.s. the constant zero, implies m Æ 0. The condition 

F(0) 1 
0= E(X)= f F~} (v) du + F=! (v) dv = m(1 — 2F(0)) 
0 F(0) 


implies F'(0) = 1/2 and, thus, 


which is the assertion. 


IDEMPOTENT D-NORMS ON R? 


The next result characterizes bivariate idempotent D-norms. 


Proposition 1.9.4 A D-norm ||-||, on R? is idempotent iff ||-\|) € 


dla» lllo} 


Proof. It suffices to establish the implication 


lllo = Hlo Illo # lilla lllo = llla- 


Let ZY) = (Z®, ZP) and Z® = (Z{®, ZP) be independent and 
identically distributed generators of ||:||p. According to Corollary 1.7.2, we can 
assume that Z® + ZP = 2 = Z® + ZP. Put X := Z® —1,V := Z® — 1. 
Then, X and Y are independent and identically distributed with X € [—1, 1], 
E(X) = 0. From the equation max(a, b) = (a +b)/2+ |b — a| /2, which is valid 
for arbitrary numbers a, b € R, we obtain the representation 


E (max (Z®Z®, zoa) 


(1) 7(2) (1) 7(2) 
Zy Z LoL. 1 


=1+ E (|Z -1+Z® -1l 
=1+E(\X+Y)) 


as well as 
E (max (2, z®)) =1+ E(X). 


Lemma 1.9.3 now implies that P(X = m) = P(X = —m) = 1/2 for some 
m € (0, 1]. It remains to show that m = 1. 
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Set x = 1 and y = a, where 0 < a < 1 satisfies a(1 + m) > 1 — m. Then, 
a(1 +m)? > (1— m)? as well, and we obtain 


I(2,v) lp» = E (max (212), a (2 - z) (2- 21))) 
= Í max ((1 — m)’, a(1 + m)’) 
+ Í max (G +m)’, a(1 —m)’) 


+ Í max (1 — m?,a(1 — m?)) 


1 1 
= qa +m)? + 7 +m)? +=(1—m?) 


NI =| 


= F(1 + m}?(1+ a) + (1 — m?) 
and 
I(@.Dllp = B (max (Z®,a (2- Z®))) 
= Í max(1 +m,a(1—m)) 


1 
+ 5 max(1— m,a(1+m)) 


NLR Mele 
— 
ay 
j! 


tm) + Lal +m) 


— 
jmi 
| 


+ m)(1 +a). 


From the equality ||(z,y)||p2 = || (x, y)l|p and the fact that 1+m > 0, we 
thus obtain 


T(t + m)(1 +a) + 5(1—m) = F +a) 
<=> (m-—1)(a—1)=0 


<> m=], 


which completes the proof. 


IDEMPOTENT D-NORMS ON R@ 


Next, we extend Proposition 1.9.4 to arbitrary dimensions d > 2. Denote 
again by e; the i-th unit vector in R, 1 < i < d, and let ||-||p be an arbitrary 
D-norm on R’. Recall that, for 1<i<j<d, 


læ yllo, ; =lIter+yellp, (jy) eR’, 


defines a D-norm on R°, called the bivariate projection of ||-||p. If Z = 
(Z1,...,Za) is a generator of ||:||p, then (Z;, Z;) generates ||-||p, ,- 
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Proposition 1.9.5 Let ||-||, be a D-norm on R? such that each bi- 
variate projection ||:||p, , is different from the bivariate sup-norm ||-||,,- 


Then, ||-||p is idempotent iff |o = Illl- 


Proof. If ||-|| 5 is idempotent, then each bivariate projection is an idempotent 
D-norm on R?; thus, each bivariate projection is necessarily the bivariate D- 
norm ||-||, by Proposition 1.9.4. In other words, ||e; + e;|| , = 2 for 1 < i < 
j < d. Corollary 1.3.5 now implies ||-|| 5 = Illl- 


COMPLETE DEPENDENCE FRAME OF A D-NORM 


If we allow bivariate complete dependence, i.e., ||-|| b= Illo (see the com- 
ments after Theorem 2.3.3), then we obtain the complete class of idempotent 
D-norms on R? as mixtures of lower-dimensional ||-||,.- and |]-||,-norms. To 
this end, we first introduce the complete dependence frame of a D-norm. 
Let ||-|| be an arbitrary D-norm on R? such that at least one bivariate 
projection ||-||p,, equals ||-||,, on R?. Then, there exist non-empty disjoint 
subsets Aj,...,AxK of {1,...,d}, 1 < K < d, |Ak| > 2,1 < k < K, such that 


X Tiei 


iC Ak 


= max |2| , xeERt1<k<K, 
tC A, 


D 


and no other projection [Ees ee; || p B c {l,...,d}, |B| > 2, B Æ Ag, 
1 < k < K, is the sup-norm ||-||,, on RIBI, We call A1,..., Ag the complete 
dependence frame (CDF) of ||-|| p. If there is no completely dependent bivariate 
projection of ||-||,,, then we say that its CDF is empty. 

The next result characterizes the set of idempotent D-norms with at least 
one completely dependent bivariate projection. 


Theorem 1.9.6 Let ||-||,, be an idempotent D-norm with non-empty 
CDF Aj,..., Ax. Then, we have 


K 
lælp = X max |a,| + Se Izi], 2 €R*. 
i 


ic{1,.. d }\UŽ Ak 


On the other hand, the above equation defines an idempotent D-norm 
on R? with CDF Aj,..., Ax for each set of non-empty disjoint subsets 
Aj,..., ÅK Op lea with |Ax| 22; 1<k<K<d. 


Proof. The easiest way to establish this result is to use the fact that G(x) := 
exp(— ||x||,5), £x < 0 € R®, defines a df for an arbitrary D-norm ||-|| on R°. 
This is the content of Theorem 2.3.3 later in this book. 
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Therefore, let n = (m,..-,7a) be an rv with this df G. Then, for æ < 0 € 
Rt, we have 


G(x) = exp (— ||z||p) 
= Pn <r, 1<i< d) 


C 
: 7 ; K 


where k* € A, is an arbitrary but fixed element of Ay, for each k € {1,..., K}. 
The rv 7* with joint components ny, 1 < k < K, and nj, j € (UK, Ax)’, is 
an rv of a dimension less than d, and 7* has no pair of completely dependent 
components. The rv 7* may be viewed as the rv 7 after having removed the 
copies of the completely dependent components. Its corresponding D-norm is, 
of course, still idempotent. From Proposition 1.9.5, we obtain its df, i.e., 


K 
G(x) = exp -> ain zil — 5 |x; | 
k=1 je(UK Ar)? 
K 
= exp | — max |x;| — 5 laj| |, x<0€R?, 
k 
k=l je(UE, An)” 


which is the first part of the assertion. 
On the other hand, take an rv U that is uniformly distributed on the set 


of integers {k*: 1 < k < K}U (UK, Ae)”. Put m := K + (UE A)", and 


set for i =1,...,d 
rea 
Z mM, te iy 
0 otherwise, 


ifU =k*,1<k<K, and 


m, i=j, 
Zi = 
0 otherwise, 
E c , 
ifU= J7€ (Uk, Ax) . Then, E(Z;) =1,1<i<d, and 


E (nas (|x| z,)) 


1<i<d 
=- XB aay thn 29100 =) 
JE {k*:1Sk<K}U(UK_, An)” 

K 

= ; ; R£. 
D >> lal, we 


je (UK, Ar)" 
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It is easy to see that this D-norm is idempotent, and thus, the proof is com- 
plete. 


The set of all idempotent trivariate D-norms is, for example, given by the 
following five cases: 


max(|æ|, yl, l2), 
max(|a],|y|) + |z|, 

[æy zll = y max(lzl, |z) + lul, 
max(|yl , |z|) + |æļ, 
lel + lyl + lI, 


where the three mixed versions are just permutations of the arguments and 
may be viewed as equivalent. 


TRACKS OF D-NORMS 


The multiplication of D-norms D1, D2,... on R? can obviously be iterated: 


llig: Di = Ilona M Di? ne N. 


In what follows we investigate such D-norm tracks Ili, po n € N. We 
show in particular that each track converges to an idempotent D-norm if 
Illo, = W-llp» i €N, with an arbitrary ||- ||p-norm D on Rê. 


Example 1.9.7 (Continuation of Example 1.9.2) Recall that any 
positive semidefinite d x d-matrix X is the covariance matrix of a multi- 
variate normal rv with mean vector zero, and thus it defines the Hiisler— 
Reiss D-norm ||-||;p,,- By the principal axis theorem, there exists an 
orthogonal d x d-matrix O, i.e., the transpose OT of O is the inverse 
O`t of O, such that 


OT,” 


0 Xd 


where A; > -+ > Aa È 0 are the eigenvalues of X; see, for example, 
Lang (1987). This equation implies the spectral decomposition of X 


E = Mrr] +---+Aarart, 


where r; € Rf is the i-th column of the matrix O, 1 < i < d. Clearly, 
each product r;r{] is a positive semidefinite d x d matrix, and thus the 
Hiisler-Reiss D-norm ||:||yr, is the product of the norms lllar, 


ISi sd: 


Pes 
tr 
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In short notation, we have 


I ‘lary =|| ‘In, HR 


z 
Arar} 


One may call this representation the spectral decomposition of the 
Hiisler-Reiss D-norm ||-|| yp ,- 


We establish several auxiliary results in what follows. The first one shows 
that the multiplication of two D-norms is increasing. 


Lemma 1.9.8 For two arbitrary D-norms ||-|| p,, \|-||p, on R?, we have 
the inequality 


lllo, p, 2 max (Il-llp, +lllb,) - 


Proof. Let Z) and Z@) be independent generators of |\-||p, and ||-|| p,- By 
equation (1.21), we have for x € R@ 


ielo,v; = £ (ezel) (1.24) 


Note that 


zl, = nz (z)|| = |E (az) (1.25) 


where the expectation of an rv is meant componentwise, i.e., E (Z®) = 
(6(2®),....B (22), ete Put 


T(x) :=|lzllp,, z ER. 


lo i 


Check that T is a convex function by the triangle inequality and the homo- 
geneity satisfied by any norm. Jensen’s inequality states that a convex function 
T : R? > R entails T(E(Y1),..., E(Ya)) < E(T(“1,..-, Ya)) for arbitrary in- 
tegrable rv Y1,..., Ya. We thus obtain from Jensen’s inequality together with 
equations (1.24) and (1.25) 


= 2 
ielan = (lezel) 


=» ( (22) 
>r (2 (22) 


S 


= |lzIlp, - 


Exchanging Z“) and Z?) completes the proof. o 


lo, 
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THE LIMIT OF A TRACK IS A D-NORM 


Proposition 1.9.9 Let ||-||,,, n € N, be a sequence of arbitrary D- 
norms on RÌ. Then, the limit of the track 


lim lell, p: = f(e) 


exists for each x € R? and is a D-norm, i.e., f(-) = ||-l|p- 


Proof. From Lemma 1.9.8, we know that, for each æ € R? and each n € N, 
Helge, n, < lele n, 


As each D-norm is bounded by the norm ||-||,, we have æl, p: < |lxll, 
for each n € N. Consequently, the sequence lle, p, n € N, is monotone 
increasing and bounded; thus, the limit 7 


lim elle, D: =: f(a) 


n— oo 


exists in [0,00). As the pointwise limit of a sequence of D-norms is a D-norm 
by Corollary 1.8.5, f(-) is a D-norm as well. 


IDEMPOTENT LIMIT OF A TRACK 


If we set ||-||p, for each n € N equal to a fixed but arbitrary D-norm ||-||p, 
then the limit in Proposition 1.9.9 is an idempotent D-norm. 


Theorem 1.9.10 Let ||-||,, be an arbitrary D-norm on R¢. Then, the 
limit 


lim |[2[Ipn =: lello», z ERY, 


is an idempotent D-norm on R¢. 


Proof. We know from Proposition 1.9.9 that ||-|| p. is a D-norm on R¢. Let Z* 
be a generator of this D-norm, and let Z®, Z), ... be independent copies 
of the generator Z of ||-|| 5, independent of Z* as well. Then, for each x € R¢, 


we have 
Z| pn = e( a[ [Zz tn00 |æ] p- 
i=1 A 


according to Lemma 1.9.8 and Proposition 1.9.9, as well as for each k € N 
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k 
Itllp» =£ | le] [Z® J] 2 


i=1 j=k+1 
k n 

= fE x][2 J] 2 
i=l j=k+1 


i=l D* 
k . 

= ( 2Z* | [Z® ) 
i=1 os 


by the monotone convergence theorem. We thus have 


k 
lelo- = e( 2Z* J [z0 ) 
i=1 = 


for each k € N. By letting k tend to infinity and repeating the above argu- 
ments, we obtain 


lællp =£ ( 


which completes the proof. 


k 
aZ*|[Z 
i=1 


Tk>o E (l£ Z" p) = lello > 


Co 


Recall that the multiplication of D-norms is, according to Lemma 1.9.8, 
a monotone increasing operation. Together with the fact that ||-||_, is the 
smallest D-norm (see equation (1.4)), we can draw the following conclusions. 
If the initial D-norm ||:||p has no complete dependence structure among its 
margins, i.e., if its CDF is empty, then the limiting D-norm in Theorem 1.9.10 
is, by Proposition 1.9.5, the independence D-norm ||-||,. Otherwise, the limit 
has the same CDF as ||-||p. 

The limit of an arbitrary track || ‘In, p; n EN, is not necessarily idempo- 
tent. Take, for example, an arbitrary D-norm ||-|| p,» Which is not idempotent, 
and set lp, = llla # > 2 


AN APPLICATION TO COPULAS 


Let the rv U = (U1,..., U4) follow a copula, i.e., each component U; is uni- 
formly distributed on (0,1). As E(U;) = fy udu = 1/2, the rv Z := 2U 
generates a D-norm; see also the discussion on page 152. The following result 
is an immediate consequence of the previous considerations. 
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Corollary 1.9.11 Let U),U®),... be independent copies of the rv 
U that follows an arbitrary copula on RÊ. Suppose that no pair Ui, U;, 
i Æ j of the components of U = (Ui,...,Ua) satisfies U; = Uj a.s. 
Then, for each x € R¢, 


n d 
n i (2) = : 
ty e ( mex, (ie IL )) Se lel 
J= 


i=j 
With x = (1,...,1), we obtain 


TEN Tay \ \ 
Jim 2 E (2 (i U; )) =d. 


i=l 


AN APPLICATION TO MULTIVARIATE NORMAL RANDOM 
VECTORS 


Let the rv X = (Xj,..., Xa) follow a multivariate normal distribution with 
E(X;) = 0 and covariance matrix X = (0;;)1<i,;<a- Then, 


Z = (Zi. Za) = (exp (Xi - S),...,exp (Xa - S4)) 


generates the Hiisler-Reiss D-norm ||: |y: In what follows, we require that, 
for i # j, each correlation coefficient pi; = E(X;X;)/./7uo;7 is strictly less 
than one. 


Corollary 1.9.12 Let X“, X@),... be independent copies of X. For 
each x € R, we obtain 


S yl) nO 
E (+ (2, = Xj 5 + eet) )) 


= T (9 _ 233) 
= e ( pas, (Je (x 2 )) 


d 
j=l 


With x = (1,...,1) and identical variances oi =- 
we obtain in particular 


exp(no?/2) my 
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1.10 The Functional D-Norm 


This section extends D-norms to function spaces. In particular, this entails 
an appealing approach to functional extreme value theory in Chapter 4. 


SOME BASIC DEFINITIONS 


By C [0,1] := {g : [0,1] > R, g is continuous}, we denote the well-known set 
of continuous functions from the interval [0,1] to the real line. By E[0, 1] we 
denote the lesser known set of those bounded functions f : [0,1] + R with 
only a finite number of discontinuities. Note that [0,1] is a linear space: if 
fı, fo € E[0, 1] and z1, £2 E R, then rf; + v2 fo € E[0, 1] as well. 

We introduce the set E[0,1] because it allows the incorporation of finite 
dimensional marginal distributions of a stochastic process with a proper choice 
of f € E[0, 1], as we see later in equation (1.29). 

Let Z = (Zt),¢[9,1; be a stochastic process on [0,1], i.e., Z is an rv for 
each t € [0,1]. We require each sample path of (Zt) t€[0,1] to be a continuous 
function on [0,1], Z € C [0,1] for short. We also require that 


Z220, E(Z)=1, te (0,1), 


E ( sup z1) < oO. 
0<t<1 


Lemma 1.10.1 Under the above conditions on the process Z = 
(Z:)te[o,1); 


(O12); fe ED, 


Llp = B( 


sup 
O<t<1 


defines a norm on E|0, 1]. 


Proof. We, obviously, have ||f||p > 0 and 


(F010) 


sup Z: 
te [0,1] 


Iso =E( 


sup 
0<t<1 


aN 
lœ 5 
Ae 
S 
a) 
z= 
a. 
oe 
= 
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Let ||f|| 5) = 0. We have to show that f = 0. Suppose that there exists some 
to € [0,1] with f(to) 40. Then, 


0=|Ifllp 
=E ( sup (|f(t)| z) 


te[0,1] 
2 E(|f(to)| Zto) 
= |f (to)| E(Zio) 
= |f(to)| > 0, 


which is a clear contradiction. We have thus established the implication 
Iflp=0 = f=0. 


The reverse implication is obvious. Homogeneity is obvious as well: for f € 
E[0,1] and A € R, we have 


IMllp = E ( (arte) z:)) 


sup 

0<t<1 

=E (a sup (|f(t)| 21) 
0<t<1 

=|\NE ( sup (|f(#)| 21) 
0<t<1 


= JAIZ - 


The triangle inequality for ||-||,, follows from the triangle inequality for real 
numbers |x + y| < |z| + |y|, x,y € R: 


If + fallp sal TA 1) + fld| 2) 
0<t<1 


< B( sup ((filt)| Ze + Lfa(0) 21) 


e ( sup ((falt)| Ze) + sup (f2) 21) 


= ( sup (|fi 1) +E ( suv (let) 21) 


0<t<1 
=|lfillyp +Ilfell. fi, f2 € E [0,1]. 
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MEASURABILITY OF THE INTEGRAND 


Note that, for each f € E[0,1], (f()2Z¢),¢[0,1) is a stochastic process whose 
sample paths have only a finite number of discontinuities, namely those of the 
function f. The finite set of discontinuities of the process (f(t) Z:)+ejo,1] is, con- 
sequently, non-random. This entails the measurability of sup,ejo,1) (Jf (t)| Z+): 
we can find a sequence of increasing index sets Ta = {t1, ... tn} C [0,1], n € N, 
containing all discontinuities of f for n large enough, such that 


sup (\F(0| Zs) = lim, (max (EZ) 
teE[0,1] 1<i<n 

As maxi<i<n (|f(t:)| Zt) is an rv for each n € N, the limit of this sequence, i.e., 
supzejo,1] (If (¢)| Z+), is an rv as well. We can therefore compute its expectation, 
which is finite by the bound 


ee (FOLZ = IFZ < a p (FEE ee ll WA loc 


and taking expectations. Recall that each function f € E[0,1] is by the def- 
inition of [0,1] bounded. The process Z = (21) se(0,1] is again called the 
generator of the D-norm ||-||p.- 


THE FUNCTIONAL SUP-NORM IS A D-NORM 


The functional sup-norm || f||,, = supyejo1)(/f(I), f € ZO, 1], is a functional 
D-norm. Just choose an rv X > 0 with E(X) = 1, and set Z = (Z;)te(0,1) 
with Z, = X, t € [0,1]. Then, clearly, the corresponding functional D-norm is 


Illo =£ (sp z) = E(lfll.o*) = Ilfo f€ E[0, 1). 


This example shows that the generator of a D-norm is also not uniquely 
determined in the functional setup. 
The functional sup-norm ||-||,, is again the smallest D-norm 


Iflloo < Ifilo; Ff € E[0, 1], 


(see Lemma 1.10.2 below), but unlike the multivariate case, there is no in- 
dependence D-norm in the functional setup. Suppose there exists a D-norm 
with generator Z = (Zt)tejo,1] such that 


d 
Hl a. = E ( pg (hel 2) = D leil 


į=1 


for any choice 0 < ty < +++ < ta < 1 of indices and æ = (z1,..., £4) € Rf, d € 
N. Then, by the continuity of Z = (Z:)tejo,1] and the dominated convergence 
theorem, we obtain for the constant function 1 
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lp -2( sup “| 
te [0,1] 


=E ( lim max Zn) 
n= l<i<n 


= lim E ( max Zum) 


n— o0 1<i<n 
n 
E D 
i=1 
thus, a functional independence D-norm does not exist. Furthermore, no func- 
tional Lp-norm is a D-norm; see Corollary 1.10.4 below. 
BOUNDS FOR THE FUNCTIONAL D-NORM 


All norms on R? are equivalent, i.e., for two arbitrary norms ||-|\,, ||-||, on R? 
there exists a constant K > 0, such that 


Ie]; < Kllællz; æ € R. 


This is no longer true for arbitrary norms on E[0,1]. But it turns out that 
each functional D-norm is equivalent to the sup-norm || fllo = suPze;o,1) IFE) 
on E[0, 1]. 


Lemma 1.10.2 Each functional D-norm is equivalent to the sup-norm 
lllo precisely, 


IF SMF lla < Ifl Wt, f€ EI0, 1). 


Proof. Let Z = (Zt),¢19,1; be a generator of ||:||p. For each to € [0,1] and 
f € E[0,1], we have 


lf (to)| = E (IF (to)| Zeo) 
<E ( sup (|f(¢)| z) 


te[0,1] 


= IIfllp 
< E (fll lZ) = Ifl lHo > 


which proves the lemma. 
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Corollary 1.10.3 For f,g € E[0,1], we have the bound 


Ufo —llgllp | < IF = gll Ilo - 


Proof. As ||-|| 5 is a norm, it satisfies the triangle inequality 


IIfllpb < IF = sll» + llgllo - 


Lemma 1.10.2 now implies 


IIfllp —Ilgll> < IF- 9l» SIF = sll Illo - 


Exchanging f and g implies the assertion. o 


FUNCTIONAL Lp-NORMS ARE NoT D-NORMS 


Different than the multivariate case, a functional logistic norm is not a func- 
tional D-norm. 


7 
Corollary 1.10.4 No norm || f|], := ie | f (t)|? dt) with p € [1, co) 


is a D-norm. 


Proof. Choose e € (0,1) and put fe(-) := 1)o,<j(-) € [0,1]. Then, || fz||,, = 
1> t = |j fell p: The Lp-norm, therefore, does not satisfy the first inequality 
in Lemma 1.10.2. o 


A FUNCTIONAL VERSION OF TAKAHASHI’S THEOREM 


The next consequence of Lemma 1.10.2 is obvious. This is a functional ver- 
sion of Takahashi’s Theorem 1.3.1, part (ii). Note that there cannot exist an 
extension of part (i) to the functional case, as ||-||, is not a functional D-norm 
according to the preceding result. 


Corollary 1.10.5 A functional D-norm ||-|| 5 is the sup-norm ||- ||- iff 


lb = 1. 


EXAMPLE: THE BROWN-—RESNICK PROCESS 


A nice example of a generator process is the geometric Brownian motion 


t 
Zı := exp (2 — 3 ; t € [0,1], (1.26) 
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where B := (B;)i>0 is a standard Brownian motion on [0,00). The corre- 
sponding maz-stable process is a Brown—Resnick process (Brown and Resnick 
(1977)); see Section 4.2. 

The characteristic properties of a standard Brownian motion B are that 
it realizes in C[0, 1], Bo = 0 and that the increments B;— B, are independent 
and normal N(0,t — s) distributed rv with mean zero and variance t — s, 
formulated a little loosely. As a consequence, each B, with t > 0 follows the 
normal distribution N (0, t) with mean zero and variance t. We have, therefore, 


Z > 0, te [0,1], 


and, for t > 0, 


(Zs) = exp (=£) Blexp(Bo) 
i (-5) E (exp (2 )) 
a (-5) is expli! 22) arg os ( >) de 


=" (1.27) 


as exp (—(x — t'/?)?/2) /(27)!/?, x € R, is the density of the normal 
N(t!/2, 1)-distribution. 
It is well known that, for x > 0, 


P| sup Bk > 2x) =2P(B, > 2), 
te [0,1] 


which is called the reflection principle for the standard Brownian motion; see, 
for example, Revuz and Yor (1999, Proposition 3.7). From this equation and 
the representation of the expectation of an rv in Lemma 1.2.2, we obtain 


E ( sup z) <E ( sup (exp o) 
te[o,1] te[0,1] 
= E | exp | sup Br 
te[0,1] 
= P | exp | sup B | >x] dz 
0 te [0,1] 


=< 1+ f P ( sup B, > et) dr 
1 


te[0,1] 


58 1 D-Norms 
=1+2 i PCB Sloxteide 
1 


<1+ 2 | P(exp(B) > x) dx 
0 


= 1+ 2E(exp(B;)) 
< Ww, 
as exp(B,) is standard lognormal distributed with expectation exp(1/2). 


The exact value of the complete D-norm ||f||p is unknown for arbitrary 
f, but we can compute the bivariate D-norm 


II(z,Wlo,, = E (max(|2| Zs, |yl4)), tye R,O<s<t<l. 


This knowledge is sufficient, for example, to reconstruct a Brown—Resnick 
process by means of a max-linear model, as in Section 4.3. 


Lemma 1.10.6 In the case of the Brown—Resnick standard maz-stable 
process with a standard geometric Brownian generator process, we have 


(@,Wlp,, = ble (= J eted 
V3 , esl /le)) 


+|y|& 
wie(S v 


forz,y ER and0 < s<t< 1, where ® denotes the standard normal 
df on R. 


An inspection of the proof of Lemma 1.10.6 shows that the restriction 
s,t < 1 can be dropped. 

Note that ||:||p, , in the preceding lemma equals the bivariate Hüsler-Reiss 
D-norm ||-||;7R,,, with covariance matrix 


B= (oats? as") = (2). 


Lemma 1.10.6 can be extended to zero means Gaussian processes with sta- 
tionary increments; see Kabluchko et al. (2009, Remark 24). For the trivariate 
case, we refer to Huser and Davison (2013). 


Proof (of Lemma 1.10.6). We provide quite an elementary proof, which uses 
the independence of the increments of a Brownian motion. We have, for 0 < 
s<tandz,y>0, 


læ vlo., 
= E(max(xZ,, yZ)) 
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Recall that the increments B; — Bs, Bs = Bs — Bo of a standard Brownian 
motion are independent and normal distributed with means zero and variances 
t—s and s. As a consequence, we obtain by equation (1.27) 


1 = eeoa (1 (B.n < t2 soe (2))) 
p(B. <3 (5) 


7 (t=s)/2+log(s/y) 4 u 
=h) =( =) du 
T 


y(u) du 


s vt—s gd 
-e (3) 0( 5 + ee) 


where y denotes the standard normal density on the real line. By repeating 
the above arguments, we obtain 


HI=E (ewB,) exp(B; — Bs)1 (2. Z Bs > > ee ( ))) 


x 
y 


= E(exp(Bs))E (cB, = 8, )1 (2 =B; > = Te ( ))) 


x 
y 


To exp(u) ie (g) atu 
— Xp = TT 
2 (t—s)/2+log(«/y) t=s a 


The equation 


exp(u)y (+) = = exp(u) exp (Ca) 
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= oe (Es st) 


implies 
t ii u-— (t-s) 
IT = exp OL Le( t3) du 
3) deam Jie AN NA 
t CoO 
= exp (5) | (uw) du 
2/ J (log(«/y)-(t-s)/2)/Vt=s 
t 
2 


on 
= 


GE) 


Ji-s 2 
eolie (E e) 


by appropriate elementary substitutions and the equation 1 — (u) = &(—u), 
u € R. The assertion is now a consequence of the equation 


£ exp (-5) I+yexp (-5) I 


DUAL D-NORM FUNCTION 


We can also extend the multivariate dual D-norm function in (1.11) to func- 
tional spaces by setting 


V ftp := B (ing inf (Pe )1Z:)) j f € E[0,1]. (1.28) 


As in the multivariate case in E 1.6.3, the value of X f 2p does not 
depend on the particular process Z = (Z:)rejo,1]; that generates the functional 
D-norm ||-||p- 


Lemma 1.10.7 Let Z = (Z;)tejo,1) and Z= (Zt) e€{0,1) be two genera- 
tors of the functional D-norm ||-|| 5. Then, 


B (int, (FOIZ) =B( int (OZD); Fe 201 


teE [0,1] 


1.10 The Functional D-Norm 61 


Proof. Choose f € E[0,1]. As before, we can find a sequence of increasing 
index sets Tn := {t1,...,tn} C [0,1], n € N, such that 


inf (|f(t)| Z) = lim min (|f(t)| Ze) 


te [0,1] noo 1l<i<n 


and 


f (f@)|Z) = lim min (I)a) 


elo, 1] n—-oo l<i<n 


But for each n € N, (Zn); and (Za), are generators of the same 


D-norm llo, on R”, as they satisfy for x = (z1,..., £n) € R” 


R A 


E ( max (|æ:| Ze, )) =E (2, see (| fae (t)| zo) 


1<i< 
=E a (I®l 2) 
E (2a (lz zn) : 


noia fi=4 = T fe (0,1, (29) 
i=1 


0 elsewhere 


which defines a function in E[0, 1]. 
Since V £ VD, + does not depend on the generator of ||-| Da 
Corollary 1.6.3, we have, for each n € N and æ = (z1,..., £n) € R”, 


E (i, (|xi| Ze, ) =E (in, (lz: va )) . 


The monotone convergence theorem now implies 


E 


(int (OIZ) 


te[0,1] 


lI 
B 
& 
a 
z 
B 
S 
a 
2 
N 
a 
S a 


“Baa (002) 


= E ( int, (12012) ): 
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It is easy to construct a generator of a functional D-norm such that the 
corresponding dual D-norm function is zero for each f € E[0, 1]. Choosing 
Z = (Z;)tejo,1] as the constant function 1, we obtain from the arguments in 
(1.14) that 

Ufo = min |f(t)|,  f € E[0,1), 
te [0,1] 


is the largest functional D-norm, i.e., we have for an arbitrary functional 
D-norm the bounds 


OAS teas te = min |F, f € E[0,1]. 
te[0,1] 


A NORMED GENERATORS THEOREM 


We have established in Theorem 1.7.1 the fact that, for any D-norm ||-|| on 
R? and for any norm ||-|| on R%, there exists a generator Z of ||-||,, with the 
additional property that ||Z|| = const. The following result can be viewed as 
a functional analog of this normed generators theorem. For a proof, we refer 
to de Haan and Ferreira (2006, equation (9.4.9)). 


Theorem 1.10.8 (De Haan and Ferreira) For an arbitrary func- 
tional D-norm ||-||p, there exists a generator Z = (Z;)rejo,1) with the 


additional property sup Z = const for some const > 1. 
te [0,1] 


1.11 D-Norms from a Functional Analysis Perspective 


In this section, seminorms play a crucial role. 


Definition 1.11.1 A function ||-||, from R? to [0, 00) is a seminorm if 
it is homogeneous of order one and if it satisfies the triangle inequality, 


i.e., if it satisfies conditions (1.2) and (1.3). Different than a norm, 
condition (1.1) is not required. 


We can generate a seminorm by means of an rv Z = (Z1,..., Za) > 0 ER? 
with E(Z;) < co, 1 < i < d, by defining 


_ fe = d 
llælls := Æ (ma, (l z;)) ; x = (z1,... £a) E R®. (1.30) 


Note that we use a capital letter © in the index for such a seminorm, which 
is defined by a generator Z. The above definition is quite close to that of a 
D-norm in Lemma 1.1.3; the difference is that ||æ]||¢ = 0 does not necessarily 
imply æ = 0 in (1.30), as we allow E(Z;) = 0 for some j € {1,...,d}, i.e., Zj = 
0 a.s. In this case, we obtain for the unit vector e; = (0,...,0,1,0,...,0) € R? 
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lleglly = E(Z;) = 0. 


The seminorm ||:|| ç is, consequently, a norm iff E(Z;) > 0 for all j = 1,...,d, 
with the special case of it being a D-norm iff E(Z;) = 1 for all j. 

Ressel (2013, Theorem 1) characterized the set of seminorms as defined 
in (1.30) when the generators Z realize in S = {æ > 0 €R°: |la||,, = 1}. 
This characterization is achieved in the setup of functional analysis. The set 
of seminorms turns out to be a Bauer simplex, whose extremal elements are 
the seminorms with a constant generator. The aim of this section is to extend 
this characterization in Theorem 1.11.19 to the case where the generators Z 
all realize in an angular set, defined below. 

As a consequence of our considerations we show in particular in Propo- 
sition 1.11.20 that the set of D-norms, whose generators follow a discrete 
distribution on the set Sy = {æ >0€ R*: ||a||, =d}, is a dense subset of 
the set of all D-norms. 

Before we can present the results, we have to introduce various definitions 
and auxiliary results. 


Definition 1.11.2 A subset S C [0,00)¢ is an angular set if it satisfies 
the following conditions: 


(i) S #0 is compact, 


(ii) O€S, 
(iii) For any x € S and any A E€ R, we have às € S iffX=1. 


An angular set in R? is, for example, Sı := {(u, 1— u’): we (0, 1J}; see 

the discussion after Theorem 1.7.13. The set 
S2 := {(u, 1/u) u> 0} U {(0, 1), (1, 0)} 
is not an angular set in R? in the sense of Definition 1.11.2 as it is not compact. 
The set 
S3 := {(u,1 — u) : u € [0,1/2)} 

is an angular set in R?, but not a complete one as in Definition 1.7.12, since 
not every (x,y) > 0 € R? can be represented as (x,y) = A(u,1 — u), with 
some À > 0 and some u € [0,1/2]. The vector (3/4, 1/4), for example, cannot 
be represented this way. 


INTRODUCING THE RELEVANT SPACE OF SEMINORMS 


Throughout the rest of this section we suppose that S C [0,00)¢ is an angular 
set as in Definition 1.11.2. By Kgs we denote the set of seminorms on R4 
generated by means of S: 


Ks := { \|-Ig : there is a rv Z = (Z1, ... , Za) that realizes in S, with 


= i i d 
ells = E( max, (lel 2) ) for alla € R \. (1.31) 
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Repeating the arguments in the proof of Proposition 1.4.1 yields the con- 
vexity of the set Kg. 


Lemma 1.11.3 The set Ks is conver, i.e., if |l-Ilg1, Iblis are semi- 
norms in Ks, then X||-|5, + -= A) Ills € Ks for any A € [0,1] as 


well. 


Choose an arbitrary seminorm ||-||, in Ks, i.e., there is a generator Z that 
realizes in the angular set S. Next, we establish the fact that the distribution 
of Z is uniquely determined. This parallels Corollary 1.7.10. 


Lemma 1.11.4 The distribution of a generator Z € S of a seminorm 
l-l € Ks is uniquely determined, i.e., if Z, Z) are generators of 
this seminorm in Kg, which both realize in S, then we have for any 
Borel subset B C S 


P (z® € B) =P (z® € B) l 


By the preceding result, we can identify the set Ks of seminorms generated 
by means of S with the set of probability measures on S. 


Proof. Put E := [0,00)“\ {0} c R? and AB := {\b: b € B} for an arbitrary 
set B C R? and À > 0. Set 


V(E\([0, 00) + S)) := 0 
and, for all Borel subsets B of S and A > 0, 
1 
v((A, 00) - B) := PZ € B). (1.32) 


One readily finds that this defines a measure v on the Borel o-field of 
E, which is uniquely determined by the distribution of Z; see the proof of 
Lemma 1.7.8. Repeating the arguments in the proof of Lemma 1.7.9, we obtain 
for x = (x1,..., £a) > 0 E RI 


v (oa) = 2 (mas, (372) ) = 


The measure v is by equation (1.33) uniquely determined by the seminorm 
Il-\|5- Let Z® and Z°) be two generators of ||-||,¢, which both realize in S. 
Then, by equation (1.32), we obtain for an arbitrary Borel subset B of S 


- (1.33) 


S 


P (z® € B) = v((1,%) - B) =P (z E B) 


which is the assertion. 
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CONVEX HULL AND EXTREMAL SET 


In what follows, each vector space V is defined over R, i.e., if 71, £2 € V, then 
AyH1 + AQgx2 € V for à1, A2 E R. 

A subset K of V is convex if Aw, + (1 — A)ae € K for each z1, £2 E K 
and each A € [0,1]. For the sake of completeness, we establish the following 
well-known characterization of convexity. 


Lemma 1.11.5 The set K C V is convex iff 


n 
So ai € K, WE nies ee K, 
i=l 


for any n EN and \y,...,An € [0,1] with Ai = 1. 


Proof. Clearly, we only have to prove the implication “=.” It can be seen by 
induction as follows. Suppose equation (1.34) is true for n > 2. It is true for 
n = 2 by the convexity of K. Choose 41,...,An+1 € [0,1] with pea X= 1 


and @1,...,@n41 E€ K. We can assume wlog that 5>;"., à; > 0. Then, we 
obtain 

n+1 n 

5 NEE = Anti Ln41 + > Aiti 

{=l i=l 


= Anii1 £ 


3 
fi 
a 


Ài 
= Nf Ln 1+ (L= Ant) DS 8 EK 


by induction. 


For any subset X of a vector space V 


conv(X) := fæ E V : there exist n € N, £1,..., £n E X and à1,..., An € [0,1] 
with 5 A; = 1, such that 2 = 5 dia} 
i=1 i=1 
is the conver hull of X. For any convex K C V 


ex(K) := {æ € K : if x = Azı +(1—A)ae for some 21, ro € K 
and some À € [0,1], then #1 = £2} 
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Subset of R? Convex Hull 
{x1,..., aa} convex polyhedron with at most d corners 
arbitrary convex set the set itself 


the grid Z? of integers | R? 
Table 1.1: Examples of subsets of R? and their convex hulls 


is the set of extremal points of K or the extremal set of K. Here is a list of 
examples in V = R? (Tables 1.1 and 1.2). 
For every convex subset K C V, we have 


conv(ex(K)) C conv(K) = K. (1.35) 
Convex Set K Extremal Set ex(K) 
convex polygon corner points 
closed unit ball unit sphere 


open unit ball o 
{(z,y) ER”: y> 0} |Ø 
[0, 00)? { 


(0, 0)} 


Table 1.2: Examples of convex sets in R? and their extremal sets. 


While the equality conv(K) = K is an immediate consequence of the 
convexity of K, the inclusion in (1.35) follows from the fact that ex(K) C K. 

The Krein-Milman theorem for finite-dimensional normed vector spaces 
states in particular that the reverse inclusion in (1.35) is true as well if K is 
convex and compact. Here compactness is meant with respect to the usual 
Euclidean topology, generated by an arbitrary norm on R?. 


Theorem 1.11.6 (Krein—Milman, finite dimensions) Let K C 
R? be conver and compact. Then, ex(K) is compact as well, and 


K = conv(ex(K)). 


The condition that K is compact cannot be dropped. For example, let 
K #0 be an open ball in R¢. Then, ex(K) = Ø = conv(ex(K)). For a proof 
of Theorem 1.11.6, we refer to Phelps (2001, Section 1). 


BARYCENTRIC COORDINATES 


Let K C R? be a convex and compact set, whose extremal set ex(K) = 
{£1,... £n} is a set of n distinct vectors in R. For any x € K there exists 
according to Theorem 1.11.6 a vector w = (w1,..., Wn) = w(x) of weights 
Ww1,..., Wn € [0,1] with 0", wi = 1, such that 
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n 
i=1 


The vector (w1,..., Wn) is called a vector of generalized barycentric coordi- 
nates of x. 

The vector w(x) is, in general, not uniquely determined. Take, for instance, 
the unit square K = [0,1] x [0,1] in R?. Its extremal set is the set of its four 
corners {(0,0), (1,0), (1,1), (0,1)} = ex(K). The center (1/2,1/2) of K can 
be represented in two different ways: 


1 


(5. 5) = 5 (0:0) +5(41)= 5 (0. 1) + 5 (1,0). 


In this example, we have two vectors of generalized barycentric coordinates 
of (1/2, 1/2): (1/2,0,1/2,0) and (0, 1/2, 0, 1/2). 

Each vector of barycentric coordinates w = w(x) = (wi,...,Wn) for a 
fixed x € K with corresponding extremal points £1, ...,&n can be interpreted 
as a discrete probability measure Qw on the set ex(K) of the extremal points 
of K: 


n 
Qw(B) = X wea (B), B C ex(K), 
i=1 
where ¢€,(-) is the Dirac measure or point measure with mass one at z, i.e., 


€z(B) = 1 if z € B and zero elsewhere. 
As a consequence, we can write for any linear affine functional f : K > R 


f(a) = f “nf 2) Ould 


recall that a is kept fixed. This representation can easily be seen as follows. 
Each linear affine functional f : K — R can be written as f(-) = &(-) + b, 
where £ is a linear function and b € R is a fixed real number. We therefore 
obtain 


= Sowa) +b 
= dante + 6) 
= Duse) 


= f F(w") Qu(da’). (1.36) 
ex(K) 
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Definition 1.11.7 Let K be a convex and compact subset of Rê. If, 
for any x € K, the weight vector w(x) = (wi,...,Wn) of generalized 
barycentric coordinates with corresponding extremal points £1,..., £n 


is uniquely determined, then K is called a simplex, and w(x) is the 
vector of barycentric coordinates. 


Choose, for example, v1,...,U, € R? such that the vectors v2 — 
V1,---,;Un — vı are linearly independent. Then, their convex hull K := 
conv({v1,...,Un}) = {æ = J; wivi : wi > 0, X; wi = 1} is convex and 
compact. It is easy to see that its extremal points are v1,...,Un, and that 
the vector w(x) of weights of x € K is uniquely determined by the in- 
dependence of v2 — v1,...,Un — vı. In other words, K is a simplex. The 
set K = {£x >0€R*: |æ]; <1} is an example, being the convex hull of 
{0,e1,..., ea} C RI. 


LOCALLY CONVEX SPACES 


The general Krein-Milman theorem is formulated for a locally convex vector 
space V of arbitrary dimension. This is a vector space equipped with a topol- 
ogy, such that for any neighborhood U of each vector æ € V, there is a convex 
neighborhood Ue C U of a. 

Local convexity of a vector space V can be characterized in terms of semi- 
norms ||: ||. For a proof of the following characterization, we refer to Jarchow 
(1981, Section 7.5). 


Lemma 1.11.8 An arbitrary vector space V, equipped with a topology, 


is locally convex iff there is a family of seminorms 4||-||,;: 7 Zr, 


indexed by some index set Z, such that for each x € V and an arbitrary 
sequence £k E V, k EN, we have 


Dye Spare eV ee || ary, = zll >k 0. 


If the topology of V is generated by a norm ||-|| on V, then we can obviously 
choose Z = {1} with |]-||,, = ||-||. As a consequence we obtain that each 
normed vector space is locally convex. However, we do not need Lemma 1.11.8 
to see this. 


Example 1.11.9 The set of denumerable sequences of real numbers 
Ve ae (t t3): x, €R,ie N} 


is a vector space, with addition « + y and multiplication cæ, c € R, 
meant componentwise. By defining ||z||,; := |x:|, where x; is the i- 


th component of x € V, i € N, we obtain a family {Ils :i€ N} 
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of seminorms on V, indexed by Z = N. We define convergence of a 
sequence £k E€ V, k € N, tox € V by 


tk >k t: 4 VIEN: ||£k = zll Spee, Ob 


This yields the topology of element-wise convergence on V. If we put, for 
example, ex := (0,...,0,1,0,...) € V with 1 being the k-th component, 
k € N, then we obviously obtain ek 4440 0 = (0,0,...) € V. We even 
obtain kek >k 0 = (0,0,...). Equipped with this topology, V is a 
locally convex vector space according to Lemma 1.11.8. 


Example 1.11.10 The space V4 := { T R R} of real valued func- 
tions on R¢ is a vector space, equipped with the usual componentwise 
operations. By defining for æ € Rt 


fle =lf(@)l, fE Va, 


lint Ce R} of seminorms on Vg, indexed by 


we obtain a family { 
T=R4 


We define convergence of a sequence fk, k € N, to f in V by 
fe eco f e VEER A E — S Gece 0, 


which generates the topology of pointwise convergence on V. Equipped 
with this topology, Va, is according to Lemma 1.11.8, a locally convex 
vector space. 

Note that the set of seminorms Kg, derived from an angular set 
S c [0, 00) as in (1.31), is a subset of Va. 


KREIN—-MILMAN THEOREM IN ARBITRARY DIMENSIONS 


We are now ready to state the Krein-Milman theorem for a general locally 
convex space V, not necessarily a finite dimensional one. By A we denote the 
topological closure of a subset A C V, i.e., A is the intersection of all closed 
subsets of V that contain the set A. For a proof of the following result, we 
refer to Jarchow (1981, Section 7.5). 


Theorem 1.11.11 (Krein—Milman, Arbitrary Dimensions) Let 
K C V be a compact and convex subset of a locally convex real vector 
space V. Then, we have 


K = conv(ex(K)). 


Note that, because of the closure in the previous result, it is not guaranteed 
that every æ € K is the convex combination of extremal elements of K. 
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CHOQUET—BISHOP—DE LEEUW THEOREM AND BAUER 
SIMPLEX 


The following result generalizes representation (1.36) of a linear affine func- 
tional f(a) in terms of barycentric coordinates of æ in a finite dimensional 
vector space to arbitrary dimension. For a proof, we refer to Phelps (2001, 
Section 4) or Lax (2002, Section 13.4). 


Theorem 1.11.12 (Choquet—Bishop—de Leeuw) Let KCV be a 
compact and convex subset of a locally convex real vector space V. For 
every x € K, there exists a probability measure Qx on ex(K), equipped 
with the induced Borel o-field of V, such that for every linear affine and 


continuous functional f : K > R, 


f(e) = [ F(v) Qo(de). 


x(K) 


Recall that according to the Krein-Milman theorem 1.11.11 in arbitrary 
dimensions, the set K in the previous result equals conv(ex(A’)). A functional 
f : K —> R is in this setup, therefore, automatically defined on ex(K) C 
conv(ex(K)). We are now ready to define a Bauer simplex. 


Definition 1.11.13 Let V be a locally convex vector space. A convex 
and compact subset K C V is a Bauer simplex if ex(K) = ex(K) and 


if, for every « € K, the probability measure Q, in Theorem 1.11.12 is 
uniquely determined. 


If we put V = R? and equip it with an arbitrary norm |]-||, then each 
compact and convex subset K C V is a Bauer simplex iff it is a simplex in 
the sense of Definition 1.11.7. This is a consequence of Theorem 1.11.6 and 
the fact that (R4, ||-||) is locally convex. 


BACK TO SEMINORMS FROM AN ANGULAR SET 


In what follows, we apply the preceding results to the set Ks of seminorms, 
defined by generators in an angular set in [0,00)¢ as in (1.31). 

We consider Kg as a subset of the vector space Vz of functions from Rê to 
R, equipped with the topology of pointwise convergence as in Example 1.11.10. 
According to Lemma 1.11.8, Vg is a locally convex vector space. 

In Lemma 1.11.3, we established the fact that Ks is convex. Next, we 
show that it is sequentially compact, i.e., every sequence ||-||,,,, n € N, of 
seminorms in Ks contains a subsequence ||-|| 5 min) n € N, which converges 
to a seminorm ||-||, in Ks: 


n)? 
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II-llsm(n) neo llls- (1.37) 


We have equipped the vector space Vq with the topology of pointwise conver- 
gence, and thus, the convergence in (1.37) is meant componentwise, i.e., 


i d 
Jim |l2Ils.miny r> lels, BER®, 


But the fact that Kg is, in this sense, sequentially compact is immediately 
seen by repeating the arguments in the proof of Corollary 1.8.5, i.e., we have 
the following result: 


Lemma 1.11.14 The set Kg is sequentially compact with respect to 
the topology of pointwise convergence. 


METRIZING THE SET OF SEMINORMS 


Next, we prove that the topology of pointwise convergence on the set Ks can 
be metrized. It is well known that sequential compactness is equivalent to 
compactness in a metric space; thus, we obtain as a consequence that the set 
Kg is compact as well. 

Let {2 a),...} = Qf be an enumeration of the countable set Q? of 
points in R? with rational components. It is easily seen that, with llls 
[ilsa € Ks, 


1 [le®ls,- lle" 


d(Illsaslhlsa) = oe 


ten“ 1+ Loe = oe 


I sal 


(1.38) 


defines a metric on Kg. Pointwise convergence of seminorms in Kg can be 
metrized by this metric d(-,-). This is the content of our next result. 


Lemma 1.11.15 For seminorms ||-||s n: n E NU {0}, in Ks, we have 


4(I-llansllllso) aae 0 > VEERM: [lal] 5, treo leloa 


Proof. The implication “=” is easily seen and left to the reader. The implica- 
tion “=” can be seen as follows. Note that d (i ‘ll eas l-lls,o) —n—>o 0 implies 
lullsn >n=æ lulls, for any y € Q2. The angular set S is a compact subset 
of [0,00)¢, and thus, it is in particular bounded, i.e., there exists a number 


c > 0 such that S$ C [0,c]“. As a consequence, each seminorm ||-||¢ in Ks with 
corresponding generator Z = (Z1,...,Za) € S satisfies 


Ille = ;J<e. 
Alls = £ (nas, Z) <c 
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For every x € R? and for every € > 0 there exists y € Q? with ||a — yll < £- 
As a consequence we obtain by the triangle inequality for any n € N 


lælsn < llyllsn + le -= Yllsn < llyllsn t E6, 


lællsn Z WYllsn— lle =- Ylsn 2 llylls,n — &% 


as well as 
lulls ~ €¢ < llællso < Ilyllso + Ec- 
But this yields 


limsup ||ællsn = lll so < 2ec. 
n> oo 
Since £ > 0 was arbitrary, this implies 


lim sup lel sin = lælis,o| =0 
noo 


and, thus, the assertion. o 


The preceding considerations imply that the topology of pointwise conver- 
gence on Ks can be metrized: 


llls noe llls > Yz ER: |lzllsn Pn lællso 


<> d(lilsn:llls,o) >> 0. (1.39) 


Because sequential compactness is equivalent to compactness in a metric 
space, Lemma 1.11.14 has the following consequence. 


Lemma 1.11.16 The set Ks of seminorms, defined by the angular set 
S C [0,00)4, is a compact subset of Va = { f : R? > R}, equipped with 


the topology of weak convergence. 


The set Ks of seminorms is, therefore, a convex and compact subset of 
the locally convex vector space Vz. We can now apply the Choquet-Bishop- 
de Leeuw theorem 1.11.12. The following result identifies the extremal set 
ex(K s). 


Theorem 1.11.17 A seminorm in Ks is extremal iff it has a constant 
generator Z =z E€ S. 


Proof. We first show that a seminorm ||-||, with a constant generator Z = 
z = (21,...,Zd) € S is an extremal element. Suppose there are seminorms 
lls lls. in Ks with generators ZY, Z® € S, and A € (0,1) such that 


læls =Alleliga +Q—A)llalig2, z ER. 
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This is equivalent to 


nx (esla) =E ( max (lai Z)), zer’, 
where € € {1,2} is an rv with P(E = 1) = \ = 1 — P(€ = 2), which is inde- 
pendent of Z™® and Z(? (see the proof of Proposition 1.4.1). Note that Z(® 
realizes in S, and thus, it is a generator of ||-||¢ as well. But the distribution 
of a generator on S$ is uniquely determined by Lemma 1.11.4, and thus, we 
obtain Z) = z a.s. This is equivalent to 


ZY =z = Z) a.S., 


and thus, a seminorm ||-||, with a constant generator in S is extremal. 

Next, we show that each extremal seminorm in Ks has a constant gen- 
erator Z = z € S. We establish this fact by showing that each seminorm in 
Ks, whose generator Z € S satisfies P(Z = z) < 1 for each z € S, is not 
extremal. Let Z be such a generator. Then, we can find two disjoint Borel 
subsets A and B of S with AU B = S and P(Z € A) > 0, P(Z € B) > 0. 
Otherwise, we would readily derive a contradiction. 

Let the rv Z(® follow the elementary conditional distribution P(Z € - | 
Z € A), and let the rv Z (2) follow the elementary conditional distribution 
P(Z € -| Z € B). Note that Z, Z©) € § are both generators of seminorms 
in Kg. For æ € Rt, we have 


E ( max, (le Z)) 


= PZ e AE ( (z121 Z€ A) 


max 
1<j<d 


+P(Z € B)E ( (1) | Ze B) 


max 
1<j<d 


7 pt 7 1) 7) 
=\E (max, (les Z; )) +(1-A)E ES (lz; Z; )) ; 


with À := P(Z € A) € (0,1). Note that the distributions of Z® and Z are 
different. The seminorm generated by Z is, therefore, not extremal. 


INTRODUCING A HOMEOMORPHISM 


The functional T : S + ex(Ks), which maps each z € S onto the seminorm 


llls, € Ks with constant generator Z = z = (z1,...,2a) € S, i.e., 
ells. 2 San ena) T = (£1,..., £a) Ee Rf, 


is, obviously, one-to-one. We also have for a sequence Zn € S, n € N, 
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Zn —n00 2 <== llls z, > noo llls,z> 


i.e., the functional T, as well as its inverse functional, is continuous. The func- 
tional T is, therefore, a homeomorphism. It maps the Euclidean topology on 
S one-to-one onto the topology defined on ex(Ks), which is the topology of 
pointwise convergence. It can be metrized as in (1.39). We state this relation- 
ship explicitly. 


Lemma 1.11.18 The angular set S C [0,00)4 is homeomorphic to 


ex(Ks) with the homeomorphism T. 


As S$ is a compact subset of R?, the set T(S) = ex(Ks) is compact as well. 
In particular, it is closed, i.e., 


ex(Kg) = ex(Ks). (1.40) 


The next result was established by Ressel (2013, Theorem 1) for the com- 
plete angular set S = {a € [0,1]*: ||a||,,} = 1. Its extension to an arbitrary 
angular set was proved by Fuller (2016). 


Theorem 1.11.19 For an angular set S, the set Kg is a Bauer sim- 
plex. The extremal elements are the seminorms ||-||g, with a constant 


generator Z =z E€ S. 


Proof. The set Ks is, according to Lemmas 1.11.3 and 1.11.16, a convex and 
compact subset of Vz, which is a locally convex vector space, as shown in 
Example 1.11.10. According to equation (1.40), the set ex(Kgs) is closed. In 
order to prove that Kg is a Bauer simplex, it remains to show that, for every 
element ||-||¢ € Ks, the probability measure Qq., on ex(/¢s), defined in the 
Choquet—Bishop—de Leeuw theorem 1.11.12, is uniquely determined. 

Choose ||-||; € Ks and let Qq. be a probability measure on ex(Ks) = 
ex(Ks) that satisfies 


Fs) = fF (Ilse) ris (Alls) 


for every linear affine and continuous functional f : Ks > R. 

Note that f (llls) := læl|çş with x € R? kept fixed defines a linear and 
continuous functional on Kg; see Lemma 1.11.3 and Example 1.11.10. Thus, 
for every x € R4, we obtain the representation 


zls = alls Oy. (4li-llge) - 1.41 
lels = f lels Qn (Alls) (1.41) 
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According to Lemma 1.11.18, we can identify ex(Ks) with S and their topolo- 
gies as well. As a consequence, the probability measure Qll- on the Borel 
o-field of ex(Ks) can be identified with a probability measure o on the Borel 
o-field of S. Equation (1.41), therefore, becomes 


Isls = f lælls = (de) 
S 
= f max (|a;| 2) (dz) 
= E ( mex (031 Z5)) (1.42) 
SJ. 
where Z = (Z1,...,Za) is an rv in S with distribution o(-). As Z is the 


generator of the seminorm ||-||, by (1.42), its distribution is, according to 
Lemma 1.11.4, uniquely determined. This completes the proof. 


According to the preceding result, every seminorm ||-||, € Ks; thus, the 
uniquely determined distribution of its generator in S can be identified with a 
probability distribution on the extremal set ex(K‘g), i.e., with its barycentric 
coordinates, roughly. 

By Corollary 1.7.2 we know that, for any D-norm ||-||,, on R¢, there exists 
a generator Z = (Z1,...,Zq) with the property ||Z||, = D Zi = d. If we 
put Sa := {x > 0 ER? : ||æl|ı =d}, then Sq is a complete angular set as in 
Definition 1.7.12, and the family Kp of D-norms on Rĉ is a subset of the 
Bauer simplex Ks of seminorms generated by the set S. 

According to Corollary 1.8.5, we know that the pointwise limit of a se- 
quence of D-norms is again a D-norm, and thus, Kp is a closed subset of Kg. 
As Ks is metrizable according to Lemma 1.11.15 and compact according to 
Lemma 1.11.16, Kp is compact as well. According to Proposition 1.4.1, it is 
also convex. 


A DENSE SUBSET OF THE SET OF D-NORMS 


A subset Km of the set Kp of all D-norms on R? is dense in Kp, if for each 
IIb € Kp there exists a sequence ||-||p p; n E€ N, of D-norms in Ky that 
converges to ||:||p pointwise, i.e., 


i d 
Jim lello» = llelo, eRe 


Our final result in this section provides a dense subset of Kp. 


Proposition 1.11.20 The set of D-norms, whose generators follow a 
discrete distribution on Sa = {x > 0 E€ R°: ||x||, =d} is dense in the 


set of D-norms. 
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Proof. The extremal elements in Kg are, according to Theorem 1.11.17, 
those seminorms that have a constant generator. The Krein—Milman theo- 
rem 1.11.11 implies that, for any D-norm ||:||p on R, there exists a sequence 
of convex combinations 


m(n) 


Ills = >, win llllsin? TEN, 
of seminorms ||-||,,,, € ex(Ks) with constant generator Zin := Zin € S, 
1<i<m(n), n €N, such that 


lællsn >n>o llælp, æ ERY. (1.43) 


The weights w;.n > 0,1 < i < m(n), which add up to one, i.e. EZ w Win = 1, 
together with the vectors 21,n,-..;2m(n),n define a discrete probability mea- 
sure on S' for each n € N via 


):= 5 Wi,nEzin(*) 
isl 


Let Z") :— (a, .. ey € S be an rv, which follows this discrete 


probability measure P, with support zinia Sean be n € N. The rv 
Z(™) generates the seminorm lllgn, since for every x € R? with zin = 
(Zins ae , Zign,d)s we have 


| (rn) _ (n) _ 
E CAGE )) = 2 max (lezl zinj) P (Z = zin) 
m(n) 
= 5 lælls in Win 
i=1 
= Jiælls,n 


However, the seminorm ||-||, „ is in general not a D-norm, as the condition 
on its generator E (z) = 1, 1 < j < d, is not generally satisfied. From 
(1.43), however, we conclude 


p” =E Ge =lejllg, maa lelo =i; F=1,--,4 (1-44) 


where e; = (0,...,0,1,0,...,0) € R? denotes the j-th unit vector in R¢. 
Note that 
d 


D = Da es a 
j=l 
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thus, Bn := maxi<;<a ae > 1 as well as Bn noo 1 by (1.44). We have 
gm 


d 
1 Bn- £ 1  dB,-d 
—+)5 a ee a 
By = dB, By dB, 


and each summand on the left-hand side of the above equation is non-negative. 
Let 6; be the probability measure on Sg that puts mass one on the vector 


dej, 1 < j < d. Then, 


d (n) 
B, — 6: 
sy. fen 


(n) 
1 n — Ê}; 
|Qn(B) a Pa (B)| <1- B +> =p —> n= oo 0 (1.45) 


for any Borel subset B of Rt. Moreover, for each j € {1,...,d} and n € N, 
we have (n) 
1 a(n), Bn — Bi" 
Qh = — 6! d=1. 
ie "i a e Bn f; H dBn 


Let the rv Z™ = (2, shag Zz.) € Sq follow this distribution Qn. We 
have 


E (2) -f zj Qn(dx) = 1 


for each j € {1,...,d}, and therefore, Z™) generates a D-norm, say, Ilon- 
The convergence in (1.45) together with Lemma 1.2.2 imply for x € R? 


— | am) 
elu = (max, (112) 


=, 1- P(x] 2 <t, 1<j<a) dt 
0 
dmax}<j<a|®j 2 
=| 1 P (|x| 2 <t,1<5 <a) at 
0 
dmax}<j<a|®j 
-f 1- P (|x| Zh? <t, 1<j <d) dt + off) 
0 


= E ( max (le; zi) + o(1) 


1<j<d 


= |lælls n + OC) nse [lll » 


which completes the proof. 
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1.12 D-Norms from a Stochastic Geometry Perspective 


Each D-norm on R? can be characterized by a particular convex and compact 
subset of [0, 00) called maz-zonoid, which was observed by Molchanov (2008). 
This characterization, which is essentially Corollary 1.12.17 below, is achieved 
within the framework of stochastic geometry. We list only a few auxiliary re- 
sults and tools, which we need for the derivation of this characterization. As 
an application, we can considerably extend the well-known Hoélder’s inequal- 
ity to D-norms; see Theorems 1.12.22 and 1.12.24. For a thorough study of 
stochastic geometry, we refer the reader to the book by Molchanov (2005) and 
the literature cited therein. 


ORTHOGONAL PROJECTION ONTO A LINE 


For arbitrary vectors x = (£1,..., £a) and y = (y1,..., ya) in R%, put 
d 
(x, y) = 5 TiYi E R, 
i=1 


which is the usual scalar product or inner product on R°. This is obviously a 
bilinear map: 


(say + tx2, y) T s(£1, Y) Tv t(x2, Y), 
(x, syYyı + ty2) = s(x, yı) Tv U(x, y2), s,t ER. 


Note that ij 
d 
va) = bs =) = lll, 


is the Euclidean norm on Rê. 

Fix x € R? with unit length ||æ||, = 1. Mapping y € R? onto the linear 
combination (æ, y) € R has the following geometric interpretation, (see Fig- 
ure 1.1 for illustration). First, recall that y sits by definition at a right angle 
to x iff (x,y) = 0. The vector x defines the line Lẹ = {sæ : s € R} in R°. 
Projecting the vector y orthogonally onto this line means that the projection 
sox sits at a right angle to the vector y — soa, i.e., (£x, Y — soz) = 0 or, 
equivalently, 


(x,y) = (a, 89%) = so(æ, £) = so ||x||2 = so- 


We see that the inner product (x,y) of x and y is just the coordinate so 

of the orthogonal projection of y onto the line Læ. If x has arbitrary length 
2 

llæll > 0, then (a, y) = so ||x||5- 
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Fig. 1.1: Orthogonal projection of y onto the line La. 


INTRODUCING THE SUPPORT FUNCTION 
Let L C R? be a non-empty compact set. For # = (11,...,2@) E R, put 
d 
h(L, æ) := sup {(y,@) : veL) =e] un: (y1,--- Ya) er), 
i=l 


which defines the support function h(L,-) of L. The support function is one 
of the most central basic concepts in convex geometry. 

A convex and compact set L C R? is uniquely determined by its support 
function h(L,-). This is a consequence of the next result. Put, for æ € R, 


Hr(æ) := {y ER? : (y,x) < h(L,2)}, 


which is the half space of Rt that corresponds to L and x. 


Lemma 1.12.1 Let ý A LC [0,00)4 be compact and convex. Then, 


L= (Hae) 


weRe 


Proof. Each y € L satisfies (y, £) < h(L, x), and thus, L C Hz(a) for each 
x € R4, i.e., 


Lc {) Hra). 


xER¢ 


Choose z € R¢, z ¢ L. It is well known that z and L can be separated in 
the following way: we can find x € Rt, æ Æ 0, such that, for all y € L, 


(y, xz) < (z, £). 


This is the hyperplane separation theorem; see, for example, Rockafellar (1970, 
Corollary 11.4.2.). As g(-) := (+, Œ) is a continuous function on R? and L C R 
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is compact, the supremum sup {g(y): y € L} is attained, i.e., there exists 
yo € L with 


glyo) = sup {g(y) : y € L} = A(L, a) < (z, 2). 


This shows that z ¢ Hz(æ), which implies the assertion. 


The following consequence of the Lemma 1.12.1 is obvious. A non-empty 
convex and compact set in [0,00)? is uniquely determined by its support 
function. 


Corollary 1.12.2 Let Lı Æ Lə be two non-empty convex and compact 
sets in [0,00)¢. Then, h(L1,-) £ h(La,-), i.e., there exists x € R? with 


h(L, x) A h(Lo, x) . 


EACH SUPPORT FUNCTION PROVIDES A SEMINORM 


Lemma 1.12.3 Let K be a non-empty compact and convex subset of 
[0, 00). Put |æ| = (|æz1|,.-., |£al) for £ = (ai,-.., £a) E RÊ. Then, 


d 


lælls := h(K, |æ) = sup {ou vi]: y = (Y1,---, Ya) € «| (1.46) 


i=l 


defines a seminorm as in Definition 1.11.1. This seminorm is mono- 
tone, i.e., 0 < a < x) implies |e |], << jx? 


‘IIs: 


Proof. We obviously have ||x||, > 0, IlO] = 0 as well as ||Aæ]| s = |A| llælls 
for A € R. It remains to show that ||:||¢ satisfies the triangle inequality (1.3). 
Choose x), 2) € Ri and y € K; note that y > 0 € Rt. The ordinary 
triangle inequality for the absolute value of a real number implies 


(u,b + 2) = Y (ule + 2°) 
i=l 


<J (vile? + vile?) 
i=1 


= (y, a |) + (y, |e) 


and, thus, the triangle inequality 


ere = | lal ls 
S S S 


The monotonicity of ||-||¢ is obvious; recall that K C [0, o0). 
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A SUPPORT FUNCTION CAN PROVIDE A NORM 


The preceding result clearly raises the question: When is the seminorm ||-||g 
a norm? The answer is well known; see Rockafellar (1970, Theorem 15.2). 
Adapting this to our purposes, we explicitly state the characterization. 


Lemma 1.12.4 Let Ø # K c [0,00)4 be compact and conver. The 


seminorm defined in (1.46) defines a norm iff K N (0,00) £0. 


Proof. Suppose first that K N (0, o0)? # Ø, i.e., there exists y = (y1,-.-, Ya) € 
K with y; > 0,1 < i< d. Suppose ||æ||¢ = 0. We have to show that æ = 0 € 
Rt. But this follows immediately from the inequality 


d 
0 = |jælls > X- yi lz:]| 2 0, 


i=l 


thus, ||z||, =0, or æ = 0 € R. 
Suppose next that ||-||,¢ is a norm on R4. We have to show KN (0, co)4 Æ 0. 


As ||-||g is a norm, we have |le;||, > 0 for each j = 1,...,d, i.e., there exists 
yj E K, whose j-th component is strictly positive. Since K is convex, the 
vector y := Dai y;/d is in K as well, and it is in (0,00)¢. o 


Let ||-|| be an arbitrary radially symmetric norm on R4, i.e., changing the 
sign of any components of x € R? does not alter the value of ||x||. In this 
case, ||-|| is determined by its values on [0,00)¢. One may conjecture that ||-|| 
might equal h(K,-), where K = {æ > 0 € R° : |x|] < 1}. The following result 
shows in particular that this conjecture is not true. 


Lemma 1.12.5 Let p,q € [1,00] with 


sa i pe (1,00), 
q:= 41, if p = œ, 


oO, ep —ele 


Then, for the family of logistic norms on Rĉ as in Proposition 1.2.1, we 
have 


d 
|||, = A(Ka, |x|) = sup 2 yi lzi]: y 2 0 €R, |lyllg < 7 , (1.47) 


i=l 


ie., the conver and compact set that generates the norm ||-||, as in 


equation (1.46) is Kg = {y >0ER?: lull < i}. 
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Proof. In what follows, we assume wlog x = (z1,..., £4) > 0 € RI, x £ 0. 
With p = 1 and q = œ, we obtain for every y > 0 € R$, |lyll o <1 


d d 
Sank elles 
i=l j=l 


and equality holds for y = (1,...,1) € Rt. This proves (1.47) for the combi- 
nation p = 1, q = œ 
For p = œ and q = 1, we obtain for every y > 0 € R? with ||y||, < 1 


d d 
X mimi < $ lell ¥ = æl lulls < lello » 
i=1 


i=1 


and equality holds for the choice y = e;*, where i* € {1,...,d} is that index 


with xj = max(z1,..., £a) = ||x||,,. This proves (1.47) for the combination 
p=œ,q=l1. 
Finally, we consider p,q € (1,00) with p~! +q7t = 1. We obtain with 
w* := (z1,...,2a)/|lx\l, 
d 
To :y>0ERf, Illa < r} 
i=1 


= lel, oof Tel: y> 0ER", llyll, <1) 


d 
* d 
= En : y > 0ER, lyll, < i} 


i=1 


Hélder’s inequality implies for y > 0 € R? with ||y||, < 1 


d 
dizi yi < lle" lly lyla = lilla < 1, 


i=l 


therefore, it is sufficient to find y € K4 such that 5i gyi =l. 
We have equality in Hölder’s inequality if 
at =y, 1<i<d. 


7 


Therefore, put 


Then, we obtain 


d 
do riu = lle" ll, lul = Myla 
i=1 
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with 
d 1/q 
lyla = (>. z) == 
i=1 


which completes the proof. 


THE SYMMETRIC EXTENSION 
Let K C [0, 00)? be a convex and compact set. By 
L(K) := {y E R°: |y| eK}, 


we denote the symmetric extension of K; recall that the absolute value of 
a vector |y| is taken componentwise. The symmetric set L(K) is compact as 
well, but not necessarily convex; just set K := {A(1,...,1): A € [0,1]}, which 
is a line in [0,1]¢. Its symmetric extension L(K) is, for d = 2, a cross and, 
thus, not a convex set. 

The support function that corresponds to L(K) satisfies, for æ € Rĉ, 


d 
h(L(K), £) = sup 2 yt; : |y] € x) 


i=1 


d 
= wo fo val Ize]: ly] € x) 


i=1 


= h(K, |a). 


THE NORM INDUCED BY THE SYMMETRIC EXTENSION 


Let K C [0, 00)? be compact and convex with KN (0, o0) 4 0. Then, according 
to Lemma 1.12.4, 


h(L(K), x) = h(K, |x|) = lælls, x eR, (1.48) 


defines a norm on R¢, which we denote by ||-||x in what follows. We say that 
it is generated by the set K. 

Put Ke := {A(1,...,1): e< A < 1} C Rf, which is, for every c € [0, 1), a 
line in [0,1]. For æ € R? we have 


d d 
h( Ke, |x|) = sup 1 yi lzi]: y € x} = X Je: = elles 
i=1 i=1 
As a consequence, we obtain for each c € [0, 1) 


h(L(K,), x) = ||x||, , x € R¢. 


84 1 D-Norms 


This shows that a convex and compact subset of [0,00)?, whose support func- 
tion generates a norm, is not uniquely determined by this norm. Note that 
L(Ke) = {y E R? : |y| =AQ,...,1), c <À < 1} is not a convex set if d > 2 
for any c € [0, 1). 

If we put, however, K := [0,1]%, then K is convex and compact, K N 
(0,00)? # 0, and L(K) = [—1,1]¢ is convex as well. The norm ||-||,, that is 
generated by K is again ||-||,, but now K is uniquely determined: it is the only 
convex and compact subset of [0, 00)“, K N (0,00)? 4 0, generating ||-||, such 
that L(K) is convex. This is a consequence of our preceding considerations, 
summarized in the next result. 


Lemma 1.12.6 Let K C [0,0o)? be a conver and compact set with 
KN (0,00)4 £0. If L(K) is a conver set, then K is uniquely determined 


by the generated norm ||-|| x. 


Proof. For x € R?, we have 
A(L(K), x) = ||ællz - 


This equation identifies the set L(K) according to Corollary 1.12.2. But 
L(K) = {y € R¢: |y| € K} identifies the set K. 


Cross-POLYTOPES 


For z = (z1,...,2a) > 0 € RY, put 


Az := conv ({0, z1e1,..., zaea}) 
d d 
= {aa Mauzo on eth, (1.49) 
i=l i=l 
which is the convex hull of the vectors 0, z1e1,...,2aea € R?. The set A, is 


a compact and convex set in [0, o0). It is called a cross-polytope. 


Lemma 1.12.7 The symmetric extension L(Az) of a cross-polytope 


Az with z > 0 € R? is conver. 


Proof. Choose x,y € L(Az), i.e., 


d d 
L= > ÀiZiei, Y= X KiZiei, 
j=l i=l 


with 77, [Ai] < 1, 04, |si] < 1. We obtain for V € [0,1] 
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d 
dx +(1—V)y=S_(VA + (1 — D) Ki) ze: 


i=1 


with 


Sia: +(1-9 Jnl <0 Dal +( a- Sohal <i 


w=1 i=l 


i.e., |Va+(1—V)y| € Az, which completes the proof. 


INTRODUCING MAX-ZONOIDS 


Let Z = (Z1,..., Za) > 0 € R? be an rv with the property E(Z;) € (0,00), 
1 < i< d. Then, 


roe = d 
jelz = E ( max (lel Z9); æ= (ays yaa) ER 


defines a norm on Rf; see the proof of Lemma 1.1.3. It is a D-norm iff E(Z;) = 
1 fori=1,...,d. 


Definition 1.12.8 (Max-Zonoids) Let K C [0,00)4 be a compact 
and convex set with KN (0,00)? 4 Ø, whose symmetric extension L(K) 
is convex as well. If the norm ||-||,, on Rf, which is generated by K 
satisfies 


Ile = Illz 
for some Z = (Z1, ..., Za) > 0 € R? with E(Z;) € (0,00), 1 <i <d, 
then K is called a maz-zonoid. 


A max-zonoid generates a D-norm ||-||,, if E(Z:)= 1,1 <i < d. In this 
case, we call K a D-maz-zonoid. It is also known as a dependency set. 


Remark 1.12.9 A max-zonoid K C [0,00)¢ is uniquely determined by 
the norm ||-||,z. This is just a reformulation of Lemma 1.12.6. 


Example 1.12.10 Each logistic norm ||-||,,, p € [1,00], is, according 


to Proposition 1.2.1, a D-norm. Lemma 1.12.5 shows that each ||-||,, is 
generated by the D-max-zonoid K4 = fuy >0ERĉ: lulla < i}, where 
1/p+1/q=1. 
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A RANDOM CROSS-POLYTOPE 


The obvious question When is a convex and compact set K a maz-zonoid? 
was answered by Molchanov (2008). The answer is given within the framework 
of stochastic geometry. 

Let Z = (Z,...,Za) > 0 € R? be a rv with E(Z;) € (0,00), 1<i<d. 
Then, 


Az = conv ({0, Z1e1,..., Zaea}) 


d d 
= [aze uzasa), (1.50) 
i=1 


i=1 


which is the convex hull of the vectors 0,Z ,e1,...,Zqeq € RÊ, is a random 
compact and convex set in [0,00)%. It is a random cross-polytope. 


THE SUPPORT FUNCTION OF A RANDOM CROSS-POLYTOPE 


The support function of a random cross-polytope Az is, for x = (£1,..., £4) > 
Oc R?, 


h (^z, x) =sup{(y,xz): y € Az} 


d d 
-sw { (Sten). doo, Dovel} 


i=1 i=1 
d d 
= sv { Aza > Aq,..-,Aq > 0,7 SX < r} 
i=1 t=1 
= max (z;Zi), 
1<i<d 


thus, 


As a consequence, we obtain for the support function of the symmetric exten- 
sion L(Az) 


E(h(L(Az),x)) = E(h(Az,|e|)) = ||ællz;  & €R’; (1.51) 


see equation (1.48). Note that the set L(Az) is, according to Lemma 1.12.7, 
convex as well. 

The preceding observation raises the idea that the random cross-polytopes 
Az play a major role when answering the question When is a convex and 
compact set a max-zonoid? posed earlier in this section. This is actually true; 
see Corollary 1.12.17, which characterizes max-zonoids. 
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THE EXPECTATION OF A RANDOM SET 


The following definition describes a rather flexible and useful concept of a 
random closed set, see Molchanov (2005, Section 1.1.1). 


Definition 1.12.11 (Random Closed Set) Let (2,.A, P) be a prob- 
ability space, i.e., 2 is a non-empty set, equipped with a o-field A and 
P is a probability measure on A. A map X : R — F := set of closed 


subsets of R is called a random closed set if, for every compact subset 
Kc R$, 


{wEQ: Xw)NK ADEA. 


Let X be a random closed set. We suppose that 
|X], = sup {llal|, : æ € X} 


has finite expectation, i.e., E (||X||,) < oo. A random closed set X with this 
property is called integrably bounded. 

At this point, we ignore the precise definition of a proper o-field on F 
such that ||X||, is a Borel-measurable rv. Instead, we refer to Molchanov 
(2005, Section 1.2.1). 

If X is a random closed set that is integrably bounded, then X is bounded 
with probability one, and thus, it is compact with probability one. In what 
follows, we assume that X is an integrably bounded closed and convex subset 
of [0, œ0)f; thus, it is in particular compact with probability one. 

The proper definition of the expectation E(X) of a random set X, given 
below, is crucial. 


Definition 1.12.12 (Selection of a Random Set) We call an rv 
E = (£1,..-,€4) € [0,00)4 a selection of X, if € € X as. The family of 


selections of X is denoted by S(X). 


We have, for any € € S(X), 


d 
él, =X & SX, 
al 
thus, 


E (|El) -DOs < E (IXI) < 


i.e., each component &; of € € S(X) has finite expectation E(€;) < oo. Recall 
that é; > 0. The selection expectation of X is now the set 


E(X) = {E@): € € S(X)}} c [0, £ (Xh) 
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Recall that A denotes the topological closure of a set A C RÌ, i.e., A is the 
intersection of all closed subsets of R? that contain A. By the expectation 
E(€) of an rv € = (&,...,€a), we denote the vector of the componentwise 
expectations: 


E(€) = (E(1),.--, £(€a))- 


For the sake of completeness we remark at this point that, actually, we 
do not have to take the closure in the definition of E(X), as {E(€) : € € 
S(X)} is already a closed set in our framework, called an Aumann integral; 
see Molchanov (2005, Theorem 1.1.24). 


Lemma 1.12.13 E(X) is a compact and convex subset of [0,00)¢. 


Proof. Since E(X) is a closed and bounded subset of [0,00)?, it is compact. 
It remains to show that it is convex as well. For each y®,y® € E(X), 


there exist sequences ED, @) E€ S(X), n EN, with lim, EF (e) = yO), 


limy +o E (r) = y). The convexity of X implies that ne +(1- ye? € 
S(X) for each à € [0, 1] as well; thus, 


dy + (1—d)y® = lim E (AEP +1- rE”) € E(X); 


recall that E(X) is by definition a closed set. 


Lemma 1.12.14 The symmetric extension L(E(X)) of E(X) satisfies 
L(E(X)) C E(L(X)). 
If E(X) satisfies the additional condition 
0< y< z for some z € E(X) = ye E(X), (1.52) 


then we also have E(L(X)) C L(E(X)) and, thus, the equality 


Condition (1.52) is satisfied, for example, for a random cross-polytope 
X = Az, i.e., we obtain 


L(E(Az)) = E(L(Az)); 


see also Example 1.12.16. 
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Proof. Choose y = (y1,---,;Ya) E€ L(E(X)), i.e., |y| € E(X). There exists 
a sequence €( € S(X), n € N, with lim,,. E(€™) = |y|. Multiplying 
each component € of ẹ™ = Ce eee a) with the sign of y;, 1 <i < d, 
provides a sequence 9), n € N, of an rv in S(L(X)) with limno E (n™) = 
y. Consequently, y € E(L(X)) and, thus, L(E(X)) c E(L(X)). 

Let y € E(L(X)). Then, there exists a sequence €(, n € N, of rvs in 
S(L(X)) with y = limno Æ (€™). This implies |y| = limno |E (€)|. 
As €™ € S(L(X)), we have |€(| € S(X) and, by the usual inequality for 
expectations 


0<|2(e0)|<2(e 


Condition (1.52) now implies that |E (E™)| € E(X) as well for each n € N. 
Since E(X) is a closed set, this entails |y| € E(X) and, thus, y € L(E(X)). 


) € E(X). 


Example 1.12.15 Let Az = conv({0, Z1e1,...,Zaea}) be a cross- 
polytope as defined in (1.50). Check that X = Az is a random closed 
set in the sense of Definition 1.12.11. It is obviously a compact and 
convex subset of [0,00)¢ with 


d 


d 
(xh, =] So i Ziei rodeo asi 
Ei 


w=1 1 


d d 
= sup Sess Rien Aa 0s AS r} 
i=1 


i=1 


thus, E (||X||,) < E(||Z||,) < œ. We have, moreover, for arbitrary 


numbers A1,..., Aa > 0 with Da AGS 1 


d 
E: XO A:Ziei € X = Az. 


w=1 
This implies 
E(€) = (A E(Z,),...,AaE(Za)) € E(Az). 


On the other hand, any € = ({,...,€a) E S(Az) satisfies & < Zi; 
thus, E(&) < E(Z:), l<i<d. 
Consequently, we obtain for the cross-polytope 


Ax z) C E (Az) c (0, E(Z)], (1.53) 
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where Agyz) = conv({0, F(Z, )e1,...,E(Za)ea}) and [0,E(Z)] = 
(0, E(Z1)] x + x [0, E(Za)]. Lemma 1.12.7, together with Lem- 
mas 1.12.13 and 1.12.14, implies that the symmetric extension 
L(E(Az)) = E(L(Az)) is a convex set. 


CHARACTERIZING THE SELECTION EXPECTATION 


The next characterization of the selection expectation is crucial. Its proof is 
taken from Molchanov (2005, Theorem 1.1.22). 


Theorem 1.12.16 (Selection Expectation Characterization) 
Let X C R? be an integrably bounded closed and convex random 
set. Its selection expectation is the unique conver and compact subset 
E(X) C R? satisfying 


E(h(X,£)) = h(E(X), £), x eR’. 


Proof. For each u € E(X), there exists a sequence of rv €n, n € N, in S(X) 
with limno E(€n) = u. Thus, for æ € R4, we obtain 


(u,2) = ( lim E(En),2) 
Jim E((En,2)) 


< E (sup {(y, £): y € X}) 
= E(h(X,2)). 


As u € E(X) was arbitrary, this implies 
h(E(X), x) = sup{ (u, æ): u € E(X)} < E(h(X, x)). 
Next, we establish the reverse inequality. Choose « € R? and put for € > 0 
X: := {y E€ X : (y, £) > h(X,x) —e}. 


Then, X+ is a non-empty random closed subset of X. Therefore, it has a 
selection s (Molchanov (2005, Theorem 1.2.13)) and, thus, 


(£e, T) 2 A(X, x) =E. 
Taking expectations yields 
(E(€-),@) = E((€-,x)) > E(h(X, x)) — £ 


thus, 
E(h(X,x))— e€ < h(E(X), x). 
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Letting £ | 0 implies E(h(X,x)) < h(E(X),x) for x € R? and, hence, equal- 
ity. 

According to Corollary 1.12.2, the compact and convex set E(X) is 
uniquely determined by its support function h(E£(X),-), and thus, E(X) is 
the unique compact and convex set with 


h(E(X), x) = E(h(X,2)), x eR’. 


This completes the proof. 


WHEN IS A GIVEN K A MAX-ZONOID? 


You may be asking yourself, When is a non-empty compact and convex set a 
maxz-zonoid? The answer is a consequence of the previous considerations. 


Corollary 1.12.17 (Characterization of Max-Zonoids) A com- 
pact and conver set K C [0,00)4, KM (0,00)4 + Ø, whose symmetric 
extension L(K) is conver as well, is a maz-zonoid iff K = E(Az), 


where Z = (Zi,...,Za) > 0 € R? is an rv with E(Z;) € (0,00), 
1<i<d. In this case, |||+||,- = lllz- 


Proof. Suppose that K = E(Az), where Z = (Z1,..., Za) > 0 € R? with 
E(Z;) € (0,00), 1 < i < d. Example 1.12.15, together with equation (1.51), 
show that E(Az) is a max-zonoid with ||-|| = ||-||_z- 

Suppose, on the other hand, that K is a max-zonoid with corresponding 
rv Z. Then, for x € R, we have 


Ile lle = RCL), æ) 
= llellz 
= E(h(L(Az), x)) by equation (1.51) 
=h(E(L(Az)), x) according to Theorem 1.12.16 
= h(L(E(Az)), x) according to Lemma 1.12.14. 


The set L(£(Az)) is convex, as shown in Example 1.12.15, and the set L(K) is 
convex according to the assumption that K is a max-zonoid. Theorem 1.12.16 
now implies that these sets coincide, because they provide identical support 
functions as shown above. But this yields E(Az) = K, completing the proof. 


Together with Example 1.12.10, Corollary 1.12.17 implies, for instance, 
that Apz) is a strict subset of E (Az) in general. 

Each D-norm ||-|| can be identified by Corollary 1.12.17 with the set 
E(Az), where Z is an arbitrary generator of ||-||p. For example, the logistic 
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norm ||-||,,, with p € [1,co], can be identified by Lemma 1.12.5 with Kg = 


fuy >0ER?: lull < ih where 1/p+1/q=1. 
Each max-zonoid K satisfies 


Anz) CKC [0, E(Z)| 


for some rv Z > 0 € R? with E(Z;) € (0,00), 1 < i < d; see equation (1.53). 
This is a characterization of a max-zonoid in dimension d = 2. 


Lemma 1.12.18 A conver and compact set K C [0, o0)? with 


A,c Kc (0,z| 


for some z > O € R? is, in the case d = 2, a maz-zonoid; it is a 
D-maz-zonoid iff z = (1,1). For d > 3, this conclusion is not true. 


Proof. Check that L(K) is a convex set if d = 2. The norm ||-|| p, generated by 
K, is monotone; see Lemma 1.12.3. Corollary 1.5.4 implies that ||-||,, = Illz 
for some rvs Z > 0 € R? with E(Z;) € (0,00), i = 1,2. In this case z = E(Z). 

Set d > 3 and put z := (1,...,1) € Rf, K := conv({0,e1,...,ea,y}) C 
R¢, where y has constant entry 3/4. Then, Az C K C [0,1]¢, but L(K) is not 


convex: 1 1 3 3 3 3 3 
Z = (Se el = Sete K. 
su +5 ( 4’ 4? 3) (0 4’ 3) g 


DUAL NORM OF A D-NORM 
Holder’s inequality states that 


d 
Sleivil < lll, lvl, ey € RË, (1.54) 
i=1 
where ||-||,,; Illl; are logistic norms with p,q € [1,00] such that 1/p+1/q = 1. 
Both are D-norms according to Proposition 1.2.1. In what follows, we show 
that this inequality can be extended to D-norms and their dual norms. 


Definition 1.12.19 Let ||-||,, be an arbitrary D-norm on R“. A radially 
symmetric norm ||-|| is called the dual norm of ||-|| p if the D-max-zonoid 
K = E (Az), which pertains to ||-||, is the unit ball with respect to 
Il], i.e., 


K={y>0€ER°: |yl| <1}. 


We denote the dual norm by ||-||(p)- 
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In equations (1.55) and (1.56), we show that a dual norm always exists. Its 
uniqueness is shown below. We do not require ||-||(p) to be a D-norm itself. 
This is actually true in dimension d = 2; see Proposition 1.12.26. 

Prominent examples are the logistic norms ||-||,, and ||- ||, with 1/p+1/q = 
1, which are dual to one another according to Lemma 1.12.5. This symmetric 
duality does not hold in the general case, i.e., if ||-||(p) is the dual norm of 
I|:I|p, then ||-|| 5 is generally not the dual norm of ||-||(p). 


Lemma 1.12.20 The dual norm ||-||(p) is uniquely determined. 


This is a consequence of the following lemma. 


Lemma 1.12.21 Let ||-||(,) and ||-||(2) be two radially symmetric norms 
on R? such that 


{y > OER: |lyllay < 1} = {y >0ER?*: lylo < i}. 


Then, |I-\lay = llo 


Proof. Choose y > 0 € R, y # 0, and put y* := y/ |lyl| (9). Then, ||y* ||) = 1 
and, thus, ||y*||(1) < 1. But this is ||y*||(q) < lly“ llo which implies |lylla) < 


\|y| (2). Interchanging both norms implies equality. 


HOLDER’S INEQUALITY FOR D-NORMS 


The next result implies, in particular, Hélder’s inequality (1.54). 


Theorem 1.12.22 (Hélder’s Inequality for D-Norms) Let |\-||(p) 
be the dual norm of ||-|| 5. Then, we have 


d 
dole! < lel lylo = ewe RY. 


i=l 


Clearly, x and y can be interchanged on the right-hand side of the pre- 
ceding inequality. 


Proof. We can assume wlog x,y > 0 € R, y £0. Put y* := y/ lullo We 
obtain 


d 
S ziyi = (æy) = llylio) (2, y*). 
i=1 


Clearly, ||y*||(p) = 1 and, thus, by assumption, y* € K = E (Az), where Z 
is a generator of ||:||p. This implies 
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(x, y") <sup{(xz,y): ye K} 


= h(K, <x) 

= h(E (Az), £) 

=E(h(Az,«)) according to Theorem 1.12.16 
= ||ællz by equation (1.51) 

= |jællp- 


Together, we obtain 


d 
X ziyi < llelly lylo» 


i=1 


which is the assertion. 


SPECIFYING THE DUAL D-NORM 


In what follows, we specify the dual norm of an arbitrary D-norm. Recall that 
each D-max-zonoid K € [0,00]? is a convex and compact set that satisfies, 
according to equation (1.53), 


ganag 


Its symmetric extension L(K) = {y € R° : |y| € K} is convex and compact 
as well. 
For « € RÊ, x £0, put 


1 


oe: ee 1.55 
lle ll cx) max{t>0:ta € L(K)} J 


and ||O||(¢) := 0. This defines a radially symmetric norm on R4, i.e., ||@l| (4) = 
IIe ||| (4); see Lemma 1.12.23 below; it is called a gauge in Rockafellar (1970, 
Chapter 15). In particular, L(K) is obviously the unit ball with respect to 
this norm, or, equivalently, 


K= fyz 0ER: lyla) < ik (1.56) 


Lemma 1.12.23 Let K C [0,00)4 be a D-maz-zonoid. Then, Illag is 
a radially symmetric norm on R? with 


lel < lælg) Sieh, æ € R°. 


1.12 D-Norms from a Stochastic Geometry Perspective 95 


Proof. It is obvious that ||-||(;¢) is radially symmetric and that it satisfies 
conditions (1.1) and (1.2). We have to establish the triangle inequality (1.3). 
Recall that the symmetric extension L(K) of K is convex according to the 
definition of a max-zonoid. 

Choose x, y € R?, both different from zero. We can suppose x + y Æ 0 as 
well. Put ti := 1/|2l|(4), t2 = 1/ llull) € (0,00). As L(K) is compact, we 
have tıx € L(K), tay € L(K), and, by the convexity of L(K), Atha + (1 — 
A)t2y € L(K) for à € [0,1]. With the particular choice 


we obtain 


1 
Atiz + (1 —Aj)tey = EN E€ L(K). 


tı tg 


But this implies 


1 
max {t >0:t(@+y) € L(K)} > ~—> 
iti 
hence, 
le + yl : 
4 he 8 
Mus) = max{t>0: tæ + y) € L(K)} 
zi 
= 1 1 
at 
aad 
tt t 
= [le lly + llul» 
ie., ||-|(q) defines a radially symmetric norm on R¢. 


Choose « > 0 € R4, x 4 0. Next, we establish the inequalities ||x||,, < 
æl K) < ||#||,- We have the following list of implications: 


kal 
ells fla 
z£ 
= —— E€ ^q sites 1 
Jel, © 8"? 
x 
= — EK by (1.53) 
æl 
1 
=> max{r>0: re € K} > —— 
I|al|, 


= æla < lelh 
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which is one of the two inequalities we want to establish. The fact that K C 
(0, 1]¢ by (1.53) implies 


ry:=max{r>0: rg E K} <rə := max {r > 0: ræ € [0, 1]°}. 


Note that rı = 1/||x||(,) and that r2 satisfies rə ||x||,, = 1. We obtain the 
following implications: 

nally =1= r llel 

> n llæll = r2 llel 


=> liella 2 lell» 


since rı < r2. This completes the proof. 


The following result is, therefore, an obvious consequence of Lemma 1.12.21, 
Theorem 1.12.22, and equation (1.56). 


Theorem 1.12.24 Let ||-||,, be an arbitrary D-norm on R? with cor- 
responding D-maz-zonoid K. Then, ||-||(p) = Ibl is the uniquely 
determined dual norm, and we obtain 


d 


X leil < liællp lulo = By € R°. 
i=1 


Example 1.12.25 From Proposition 1.4.1, we know that the convex 
combination ||-||p := A'|-||,, + €- A) |l-ll,, of two logistic norms ||-|[,,, , 
Ikllp, on R? with pı, p2 € [1,œ], A € (0,1), is again a D-norm. Let 
Z), ZC) be generators of ||-||,,; Il- llp, and let € € {1,2} be an rv with 
P(é = 1) =A = 1 — P(€ = 2), which is also independent of Z® and 
ZC). Then, ZS) is a generator of ||-|| p- 


The dual norms are ||-|,,, lll, with 1/p: + 1/a = 1, i = 1,2, 


and corresponding D-max-zonoids Kg, = fy >OER?: |lyll noes 1} = 


E (Aza), i = 1,2. Check that the D-max-zonoid, which corresponds 
to ||-||, is, according to Theorem 1.12.16, 


E (Age) = AKq, + (1 — A) Kg. 


The dual norm of ||-||p = Alil, + GQ — 4) is, consequently, 


Ill» 


1 
lelo = sapt > 0: tele Ky, +) Kad 
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Note that, with the particular choice pı = co, pə = 1, the convex 
combination ||x||,, = A||x||,, + (1 — A) ||z|, is the Marshall-Olkin D- 
norm. 


We close this section with a characterization of bivariate dual norms. 


Proposition 1.12.26 The dual norm of an arbitrary D-norm on R? is 


a D-norm. On the other hand, each D-norm on R? is a dual D-norm. 


The mapping ||-|| > lll) between the set of D-norms and the set of 
dual norms on R? is, consequently, one-to-one. 


Proof. From Corollary 1.5.4, we know that in dimension d = 2, the radially 
symmetric norm ||-||(;¢) is a D-norm iff it satisfies ||| < Illig < Illl- But 
this was established in Lemma 1.12.23 for general dimension d > 2. As a 
consequence, we obtain that any dual norm on R? is a D-norm. 

Choose, on the other hand, an arbitrary D-norm ||:||p on R? and put 
K := {x > 0 € R?: ||æ||p < 1}. The set K C [0, 00)? is compact and convex, 
and therefore, according to Lemma 1.12.18, we only have to show that 


A41) Cc K C [0, 1}. 


But this follows from the general inequalities ||| < ||-|| 5 < ||-||, in (1.4): we 
have, fora € K, 
læl < ll#llp <1 


thus, æ € [0,1]?. On the other hand, for arbitrary x = \,e + A2€2 € Aq); 
At, A2 > 0, A1 + A2 < 1, we have 


Aie: + Aze2||p < |]|Are1 + Azea||, < A1 + A2 < 1, 


thus, æ € K. This completes the proof. 


® 


Check for 
updates 
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D-Norms & Multivariate Extremes 


This chapter provides a smooth introduction to MEVT via D-norms. Stan- 
dard references to MEVT are Balkema and Resnick (1977); de Haan and 
Resnick (1977); Resnick (1987); Vatan (1985); Beirlant et al. (2004); de Haan 
and Ferreira (2006), and Falk et al. (2011), among others. For the sake of com- 
pleteness and for easier reference, we list some basics, starting with univariate 
extreme value theory. 


2.1 Univariate Extreme Value Theory 


Let X be an R-valued rv and suppose that we are only interested in large 
values of X, where we call a realization of X large if it exceeds a given high 
threshold t € R. In this case, we choose the data window A = (t, co) or, better 
adapted to our purposes, we put t € R on a linear scale and define 


An = (ant + bn, 00) 


for some norming constants a, > 0, bn € R. We are, therefore, only interested 
in values of X € Ay. 

Denote by F the df of X. The elementary conditional df of X, given that 
X exceeds the threshold ant + bn, satisfies 


P(X < an(t+ 8) +b, | X > ant + by) 
1— F(an(t + s) + bn) 


a 1— Flant tbn) ” 


s>0. 
We let the threshold ant + bn increase with n € N; thus, we are facing the 
problem: what is the limiting behavior of 


1— F(an(t + s) + bn) 


noo! 2.1 
1- Flatt) ea) 
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UNIVARIATE EXTREME VALUE DISTRIBUTIONS 


Let X1, X2,... be independent copies of X. Suppose that there exist constants 
an > 0, bn € R, such that, for x € R, 


p (Tsen bn <a) = P(X; < ant +a) 1 <i <n) 
an 
= F"(anz + bn) 
nso G(x) (2.2) 


for some non-degenerate limiting df G, i.e., there is no xo € R such that 
G(ao) = 1 and G(x) = 0 for x < xo. Then, we say that F belongs to the 
domain of attraction of G, denoted by F € D(G). 

If F € D(G), we deduce from the Taylor expansion log(1 +€) = ¢ + O(e?) 
for € > 0 that log(1 + €)/e¢ 4.40 1 thus, the equivalence 


F” (ant + bn) —n50 G(x) 
= nlog(1— (1 — F(anz + bn))) Ans log(G(x)) 
© n(1 = F(anz + bn )) >n=>o 7 log(G(x)), 


if 0 < G(x) < 1; note that 1 — F (anx + bn) n+00 0. Hence, we obtain 


1 — F(a,(t+s) + bn) log(G(t + s)) 


1— F(ant + bn) RR log(G(t)) ` (a 


if0< G(t) <1. 

According to the classical article by Gnedenko (1943) (see also de Haan 
(1975); Galambos (1987), or Resnick (1987)), we know that F € D(G) only if 
G € {Ga : a € R}, with 


exp(— (—x)®), «2 <0, 
Ga(x) = Oe for a > 0, 
I zx >0, 
0, x <0, 
Galz) = for a < 0, 
exp(—2“), x>O0, 
and 
Go(x) := exp(—e 7), xeER, (2.4) 


being the family of reverse Weibull, Fréchet, and Gumbel distributions. Note 
that G1 (x) = exp(x), x < 0, is the standard negative exponential df. 

The assumption F € D(G) is quite a mild one, practically satisfied by 
any textbook df F. We refer, for example, to Galambos (1987, Section 2.3) 
or Resnick (1987, Chapter 1), where the condition F € D(G) is characterized 
and the choice of the constants an > 0, bn € R, is specified. 
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For the df F(x) = x, x € [0,1] of the uniform distribution on (0,1), we 
obtain, for example, with an = 1/n, bn =1, n EN, 


lim F” (1+7) = lim (1+ 2)" = exp(2) = Gila), x <0. 
n—> oo n n— o0 n 
For counterexamples F ¢ D(G), we refer to Galambos (1987, Corollary 2.4.1 
and Example 2.6.1). 
The above different representations of Ga can be unified for 8 € R by 
putting 


F g(a) := exp (-( + B2)-¥*) ; 14+ Bax > 0, 
with the convention 
Fo(x) := io F(x) = exp (—e~*), xr ER. 


Note that the signs of 8 and a in these two representations are flipped about 
zero, i.e., Fg with 8 > 0 corresponds to Ga with a < 0, etc. With this 
particular parametrization, the set of univariate distributions {Fz : 6 € R} is 
commonly called the family of generalized extreme value distributions. 


MAX-STABILITY OF EXTREME VALUE DISTRIBUTIONS 


The characteristic property of the class of extreme value distributions (EVDs) 
{Ga : a € R} is their maz-stability, i.e., for each a € R and each n € N, 
there exist constants an > 0, bn € R, depending on a, such that 


G” (anx + bn) = G(x), xeR. (2.5) 


For Gi(x) = exp(x), x < 0, for example, we have an = 1/n, bn =0,n € N: 
air TN” 
t (=) = exp (=)" = exp(2) = G1 (2). 
n n 


Let 7,7... be independent copies of an rv 7 that follows the df Ga 
with arbitrary a € R. In terms of rvs, equation (2.5) means 


(4) _ 

p (Bsn h < e) = Pin sa), gee 
an 

This is the reason why Gy is called a maz-stable df, and the set {Ga : a € R} 

collects all univariate max-stable distributions, which are non-degenerate; see, 

for example, Galambos (1987, Theorem 2.4.1). 
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UNIVARIATE GENERALIZED PARETO DISTRIBUTIONS 


Suppose that F € D(Ga). Then, we obtain from equation (2.3) 


X — dy X — by, 
p(=b cea] X=b >) 


an an 


1 — F(an(t +s) + bn) 


= 1 — 
1 — F(ant + bn) 
log(Ga(t + s)) 
n co 1 =a ve PNA 
Mi log(Ga (0) 
H,(—(1+42)), ifa>0, 
=4 Ha (1+2), ifa<0, s20, (2.6) 
Ho(s), if a = 0, 


provided 0 < Ga(t) < 1. The family of df 


Ha(z) := 1 + log(Ga(z)) 
1—(-2)*, —1<x<0, ifa>0, 
=41-r*, x>1, ifa <0, (2.7) 
1—exp(-2z), «>0, ifa =0, 


parametrized by a € R, is the class of (univariate) generalized Pareto df 
(GPD) associated with the family of EVD. Note that H, with a < 0 is a 
Pareto distribution, Hı is the uniform distribution on (—1,0), and Hp is the 
standard exponential distribution. 

The following consequence is obvious: suppose that your data are realiza- 
tions from iid observations, whose common df is in the domain of attraction 
of an extreme value df. Modeling the distribution of exceedances above a high 
threshold by a GPD is, consequently, a straightforward option. As described 
by van Dantzig (1956), for example, floods that exceed some high thresh- 
old approximately follow an exponential df. This approach is known as the 
peaks-over-threshold (POT) method; see, for example, Beirlant et al. (2004, 
Section 5.3) or Reiss and Thomas (2007, Chapter 5). 


2.2 Multivariate Generalized Pareto Distributions 


In this section, we introduce multivariate GPD. They will be particularly 
useful for a smooth derivation of multivariate max-stable df in Section 2.3. 

Let Z = (Z1, ..., Za) be a generator of an arbitrary D-norm ||-||p on R? 
with the additional property 
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for some constant c > 1. According to Corollary 1.7.2, such a generator always 
exists. Let U be an rv that is uniformly distributed on (0,1) and that is 
independent of Z. 

Put 


V = (Vise Va) = (Zaye Za) = GZ. (2.9) 


Note that, for x > 1, 


p(Zsa)=P(2<u)=1-5, 
U £ oi 


i.e., 1/U follows a standard Pareto distribution (with parameter 1). 
According to Fubini’s theorem, we have, moreover, for x > c > 1 and 
l<i<d, 


T 
1 
1- ŻE (Z;) 
T 
ess (2.10) 
x 


where P (X,Y) = (P*X) x (P*Y) denotes the product measure when the 
rvs X and Y are independent. 

The product Z;/U, therefore, follows a standard Pareto distribution in its 
upper tail. The special case Z; = 1 yields the standard Pareto distribution ev- 
erywhere. We call the distribution of V = Z/U a d-variate simple generalized 
Pareto distribution (simple GPD). 
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THE DISTRIBUTION FUNCTION OF A SIMPLE GPD 


By repeating the arguments in the derivation of equation (2.10), we obtain 
for æ > (c,...,c) =e 


PW <e)=P(Benisisd) 


=p(Zcuisisa) 


t 


t 


= ie P (v > max, (=)) (P * Z)d(21,..-42a) 
=f 1- max (=) (Px Z)d(21,..., 2a) 


=|} p(u>S1<i<a) (P * Z)d(z1,..., 2a) 
[0,c]@ Ti 


(2.11) 


i.e., the (multivariate) df of V is in its upper tail, i.e., for x > c, given by 
1 — |1/2||p. 


THE SURVIVAL FUNCTION OF A SIMPLE GPD 


By repeating the arguments in the derivation of equation (2.11), we obtain 
for x > c 


t 


PW >a)=P(UsS1<isa) 
P 


=|} p(usS1<isa) (P x Z)d(z1,...,2a) 
[0,c]4 Ti 


7 


Zi 
= P|U< mi — Px Z)d(z,... 
Jay? (U5 sia S)) Pe Aakers 
= min (+) (P x Z)d(z1,..., Za) 
[o,c]¢ 1StSd \ Ti 
. (Zi 
= (i (6) =x} 1/z tp, (2.12) 
1<i<d \ Ti 
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i.e., the survival function of V is equal to the dual D-norm function 21/2 Up, 
for x > c. 
As an obvious consequence of (2.12), we obtain the equation 
P(V > tæ) V1/(tx)Up 1 


PV ctl Vagai = eee Sl 


, , (213 
P(V > æ) VI/erp t ZL (2.13) 


independent of x > c, provided 21/x Vp > 0. Note that 1/t, t > 1, is the 
survival function of the univariate simple Pareto distribution. This is a first 
example demonstrating excursion stability or POT stability of a (multivariate 
simple) GPD. Actually, excursion stability is a characteristic property of a 
GPD in the univariate case as well as in the multivariate case; see, for example, 
Falk et al. (2011, Section 5.3) and Proposition 3.1.2. 


APPLICATION TO RISK ASSESSMENT 


Suppose that the joint random losses of a portfolio consisting of d assets are 
modeled by the rv V. 

According to equation (2.12), the probability that the d losses jointly ex- 
ceed the vector x > c is given by 


Zi 
PW > 0) = (an (=)) =U1/x Np. 
1<i<d \ a; 


Next, we apply different models for the underlying D-norm ||-||. If we 


choose ||-||5 = ||-||,,, then we know from (1.13) that X £ lo = mini<i<a |zil; 
thus, 
. 1 1 
P(V > x)= min {(—] = x > (1,...,1) 
1<i<d \ Ti max zi 
1<i<d 
If we choose ||-|| 5 = ||-||,, then we know from (1.12) that 2-21 = 0; thus, 


P(V >a) =0, x > (d,...,d). 


This example shows that assessing the risk of a portfolio is highly sensitive 
to the choice of the stochastic model. For æ = (d,...,d) and ||-||5 = Ikla 
the probability of the event that the losses jointly exceed the value d is 1/d, 
whereas for ||-||p = ||-||,, it is zero. In dimension d = 2, this means a joint loss 
probability of 1/2 versus a joint loss probability of 0. 


STANDARD MULTIVARIATE GPDS 


Let Z = (Zı,..., Za) again be a generator of the arbitrary D-norm ||-||, 
on Rf, with the additional property Zi < c, 1 < i < d, for some constant 
c > 1, and let the rv U be uniformly distributed on (0,1) and independent of 
Z. The distribution of Z/U is a simple GPD. But now we want to consider 
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U/Z; however, in this case, we may divide by zero. To overcome this problem, 
choose a number K < 0 and put 


= (oar(—2,x),..0(—,)). ero 


The additional constant K avoids division by zero. Repeating the arguments 
in equation (2.11), we obtain 


P(W <2) =1-—|e\,; zo <2 <0€ER%, 


where ao < 0 € R? depends on K and c. 
We call a df H on R? a standard GPD if there exists £o < 0 € R? such 
that 
H(z) =1-|l2||p, go <a2<O0ER?. (2.15) 
From (1.4), we obtain 0 < H(ao) = 1 — ||xo|| 5 < 1— ||xol],, and, thus, 
maxi<i<a|£oi| < 1, or £o; > —1, 1 < i < d. As a consequence, the i-th 
marginal df H; of H is given by 


H(z) = 1- ||zeil|p =1— |z] lleill =1 +2, 


for zo; < x < 0, 1 < i < d, which coincides on [xo;, 0] with the uniform df on 
(—1,0). 

A repetition of the arguments used in the derivation of equation (2.12) 
provides the survival function of a standard multivariate GPD: 


PW>a2)=Raetp, w<2<0ER’. (2.16) 


GENERAL MULTIVARIATE GPDS 


Let W = (W,,...,Wa) follow a standard multivariate GPD with corre- 
sponding D-norm ||-||,. Choose parameters aj,...,aq E R. Put a(x) := 
log(Ga(x)), 0 < Ga(x) < 1, where Gq is a univariate max-stable df from 
(2.4). Note that pa is a strictly monotone and continuous function, whose 
range is (—oo,0). Then, by definition, 


Y = (io. Ya) := Oe Wii ase (We) (2.17) 


follows a general multivariate GPD. 
For x < 0, we have 
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Note that yļ (x) = x on (—oo,0), and thus, (vy (Wi), v7 (Wa)) = 
(W,,...,Wa) follows a standard GPD. With the choice aj = --- = aa = —1, 
we obtain a simple GPD 


(YII), .--, Pat (Wa)) 


= (imme (8) “ns (r (Ee) ") 


1 


=—(4Z),...,Za)=V 

TA l; ’ d) 

if U/Z; < -K or Z;/U > -1/K for 1 <i < d. 
With aj = --- = Qaa = 0 and Z;/U > —1/K for 1 < i < d, we obtain 
(bo (Wi), -Yo (Wa)) = (—log(U/Z1),...,— log(U/Za)) 


= (log(Z1) — log(U),...,log(Za) — log(U)) , 


where — log(U ) follows the standard exponential distribution on (0, o0). 

As mentioned earlier, the characteristic property of a GPD in the uni- 
variate case as well as in the multivariate case is its excursion stability or 
POT stability; see, for example, Falk et al. (2011, Section 5.3). The df of Y is 
given by 


P(Y < æ) = P((Wis -< Wa) < (Was (21), -+s aa(ta))) 
E E T E ls 


if (Ya, (£1), -< -, Paa(za)) > £0; for such g, its survival function follows from 
equation (2.16): 


P(Y > x) = P((Wi,...,Wa) > (bos (£1), - <, Yaa l(za))) 
=% (Par (x1), tee Wes (xa)) Up. 


An alternative but equivalent definition of a general multivariate GPD, 
in terms of its copula, together with its univariate margins, is given in Re- 
mark 3.1.3. 


2.3 Multivariate Max-Stable Distributions 


In complete accordance with the univariate case, we call a non-degenerate df 
G on R? maz-stable if, for every n € N, there exist vectors an > 0, bn € RI, 
such that 
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G” (a,x + bn) = G(x), ze R4. (2.18) 


Recall that all operations on vectors, such as addition, multiplication, etc., 
are meant componentwise. The preceding equation can again be formulated 
in terms of componentwise maxima of independent copies 7), 7?),... of an 
rv 7 =(m,---,Na) that realizes in R4, and that follows the df G: 


P (“se n® = bn 
an 


<2) =P(n<a), z eR’. 


Note that both the maximum function and division are taken componentwise. 

Different from the univariate case in (2.4), the class of multivariate max- 
stable distributions or multivariate extreme value distributions, also abbrevi- 
ated as EVD, is no longer a parametric one, indexed by some a € R. This is 
obviously necessary for the univariate margins of G. Instead, a non-parametric 
part occurs, which can be best described in terms of D-norms, as is shown in 
what follows. 


SIMPLE MULTIVARIATE MAX-STABLE DISTRIBUTIONS 


Definition 2.3.1 An EVD G on R’ is called simple max-stable if each 
univariate marginal df G; of G is the Fréchet df with parameter one, or 
unit Fréchet df for short, i.e., 


Next, we show that such simple EVDs actually exist. Choose an arbitrary 
D-norm ||-||p on R2. Let VO = (V,...,V.), VO = (V,...,V.),... 
be independent copies of the rv V = Z/U, where Z = (Z,...,Za) is a 
generator of ||:||p with the additional property that it is bounded by some 
c>1€R’%, and the rv U is uniformly distributed on (0,1). The rv Z and U 
are assumed to be independent as well; thus, the rv V follows a simple GPD. 
For the vector of the componentwise maxima, 


max V) := ( max Vo, max ve, ..., max vi?) ; 
l<i<n l<i<n l<i<n < 
we obtain from equation (2.11), for x > O and n large enough such that 
nx > C, 
l (i) (a) 
P(|— maxV\" <a =P(Vv < nx, 1<i<n) 


n 1<i<n 
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= P( ete 


as 
P 


noo EXP (- 


) =: G(a), (2.19) 


where 1/æ is meant componentwise. 
Suppose that at least one component of a is equal to zero, say component 
tio. Then, 


by the fact that E(Z;,) = 1. As a consequence, we obtain in this case 


1 

P G max V® < 2) = P (V < næ)” < P (Zio = 0)” >n 0. 
Ni<i<n 

Hence, we have 


1 
P G max VO < 2) nc G(x), x eR’, 


N1i<i<n 


where 


G(x) = exp(—|zllp), if 2 >o, (2.20) 
0 elsewhere. 


Since P (n~! maxi<i<n V® < -), n EN, is a sequence of df on Rf, one easily 
checks that its limit G(-) is a df itself; see, for example, Reiss (1989, (2.2.19)). 
It is obvious that the df G satisfies 


) -eo(-|; 
= exp | — ||— 
D x 


G"(nz)=G(z), «ER neN, 


which is the max-stability of G. Let the rv € € R? have df G. By keeping 
x; > 0 fixed and letting zx; tend to infinity for j A i, we obtain the marginal 
distribution of G: 


NL 


G” (nx) = exp (- 


) = 60), z>0ER?, 
D 


and, thus, 
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Gi(xi) =P (i < zi) 
= lim, P(Gi < zi, £j < 3,5 #4) 
Yi 


L Jeil 
= €x = || €i 
p m D 
o) 
=exp | -— 
Ti 


by the fact that each D-norm is standardized. Each univariate marginal df G; 
of G is, consequently, the unit Fréchet df 


Gi(x) = exp (+) ; x>0. 


This proves that simple EVDs actually exist. We see later on in Theorem 2.3.4 
that, actually, each simple EVD can be represented by means of a D-norm as 
in (2.20). 


STANDARD MULTIVARIATE MAX-STABLE DISTRIBUTIONS 


The simple max-stable df is the standard approach in the literature on mul- 
tivariate extreme value analysis, but the standard max-stable df turns out to 
be simpler. 


Definition 2.3.2 A df G on R? is a multivariate standard max-stable 
(SMS) df, or standard EVD, iff it is max-stable in the sense of equation 
(2.18), and if it has standard negative exponential margins: 


G;(x) = exp(z), TLO VES: 


In what follows, we show that an SMS df G exists as well. Let the rv 
£ € R? follow a multivariate simple max-stable df as in (2.20), i.e., P (€ < £) = 
exp (— ||1/a||,), £ > 0 € R’. Put 


=-=- =) 
TSE Na 
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and note that P(€; < 0) = 0, 1 < i < d. Then, for æ < 0 € R, we obtain 


1 
Pin <a) = Pà- <a, tsisa) 
1 
-P(&<-Ż1<i<a) 
Ti 
ne 
x 
= exp(— ||z||_p) =: Gp (z). 
By putting 
Cote) <p (Nein. meRO lg) Gan 
for æ = (x1,..., £4) € RI, we obtain a df on R?, which is max-stable as well: 


Ch (=) = Go (2), 2 e Rt néeN. 


G% (-/n) is the df of the product n maxi <i<n nN, where n®, n®,...,n™ 
are independent copies of n. 

Note that each univariate margin of Gp is the standard negative exponen- 
tial df: 


P(n; < £) =P(n < zei) 
=exp (— ||ze;|| 5) 
= exp (— |z] |le:llp) 
=exp(z), x <0. 


This shows that SMS dfs actually exist. 


CHARACTERIZATION OF SMS DF 


We are going to show that any SMS df or standard EVD can be represented as 
in (2.21), i.e., the theory of D-norms allows a mathematically elegant charac- 
terization of the family of SMS dfs, presented in the next result. It comes from 
results found in Balkema and Resnick (1977); de Haan and Resnick (1977); 
Pickands (1981), and Vatan (1985). 


Theorem 2.3.3 A df G on R? is an SMS df iff there exists a D-norm 
Il:l| 5 on R? such that 


G(a) = exp (— |lællp), z<0ecR?. 
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Let the rv 7 = (m,.--,7a) follow an SMS df G(x) = exp (- ||æ]| p), 2 < 
0 € R’. Then, the components 7, ..., Na are independent iff the D-norm ||-|| p 
is the logistic norm ||-||,, whereas the components are completely dependent, 
(i.e, m1 =+ = na as.), iff ||-||5 = Illl- This is the reason why we call ||-||, 
the independence D-norm and ||-||,, the complete dependence D-norm. 


Proof. The implication “=” in the previous result is a straightforward con- 
sequence of the fact that the function G(x) := exp(—||x||,), x < 0 € R¢, 
is the limit of a sequence of dfs. Repeating the arguments in equation (2.19), 
one obtains, for æ < 0 € R, 


, i ; lel \” 
lim P (omw < 2) = lim (1 = — i = expl- |lællp), (2-22) 
where W®, W(),... are independent copies of the rv W, which follows a 
standard GPD as in (2.14). Note that for any D-norm on Rf, there exists a 
bounded generator according to Theorem 1.7.1. The construction of the rv W 
requires this condition. It is, therefore, readily seen that G(x) = exp (— ||x||p), 
x < 0 € R, defines a multivariate df; see, for example, Reiss (1989, (2.2.19)). 

Next, we prove the implication “=.” Let G be an arbitrary max-stable df 
with standard negative exponential margins, i.e., there exists a max-stable rv 
n= (Mm, ---, Na), whose df is Œ. Then, with c € (0,1) and a := 1/c, 


(c) aa (c) c) — t = 
OE ake E (Ga 


follows a max-stable df, say Ge, with Fréchet margins 


1 
P(e) aa) = exp (-=) =F,(z), «>0,1<i<d. 
T 


Q 


Its expectation is (1 — c) =: pe, which can be seen by applying el- 
ementary rules of integration as follows; note that the density of Fy is 
exp(—2~°)x~?—!a, x > 0. We have 


1 co 
E (—) =} xexp(—a*)2~* ladx 
[mi] 0 


ll 
Q 
Sm 
8 
8 
R 
o 
tal 
T 
| 
Si 
= 
a 
8 


=I(1—c). (2.23) 
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by proper substitution. The rv 


c c 1 
BD gag) = 7g 


now satisfies Zo) >Oand E ZL) =1,1<i<d,ie., Z© is the generator 
of a D-norm, say ||-||p,- 
The fact that G is max-stable means that, for any n € N and « € R, 


qn (=) =G(æ) or G(x) =G(nz). 


n 


This implies that, for any n,m € N and a € R¢, 


qn (Zx) = G(ma) = G” (x) 


n 


grim (Zz) = G(a). 


Note that G is continuous because each univariate margin is a continuous df; 
see, for example, Reiss (1989, Lemma 2.2.6). Letting n and m tend to infinity 
such that n/m — t > 0, we obtain by the continuity of G that max-stability 
of G is equivalent to 


Gt (=) =G(x) or G(ta) = G(x) 
for any t > 0, x € R. The final equation clearly implies that Ge satisfies 
G.(ta)=G.(a)'/", t>0,2€R*. 


In the following proof, we use the fact that G.(a) > 0 for x > 0. Otherwise, 
the max-stability of Ge would imply 


0=G.(x) = Gel)! = G.(ta) 


for some « > 0 € R? and each t > 0; letting t converge to infinity would 
obviously produce a contradiction. Using Lemma 1.2.2, we obtain, for æ > 
Oc R?, 


- 2°) 
ley, = £ ( max, (2:2) 
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= 2 u 1 — exp (Ere (c. (=))) dt 
He Jo t a 
(aE E l)e 
ija 
-CfE 


by the substitution t (— log (Ge (1/a)))'/* t; note that fo. 1—exp (—1/t%) dt 
= ue. Observe that, for x > 0 € R?, 


thus, we have for x € R? 
a l/a 
ltl, = (~ log (G (= |æ)", (2.24) 
where |x| is also meant componentwise. This yields 
lim ||æ||p, = —log(G(— |aI)). 


As the pointwise limit of a sequence of D-norms is a D-norm by Corol- 
lary 1.8.5, we have proved that 


lællp := —log(G(-|z|)), 2 RY, 
defines a D-norm on R4, or 


G(-|z|) = exp (- llælp),  & ER®, 


which completes the proof. 


CHARACTERIZATION OF AN ARBITRARY MAX-STABLE 
DISTRIBUTION 

Any multivariate max-stable distribution can be represented by a D-norm 
together with transformations of the univariate margins. 


Theorem 2.3.4 Any multivariate maz-stable df Giai, aa) with uni- 
variate margins Go,,.--,Ga, can be represented as 


aa) (T) = G (Way (21), «++ Pag (La) 
= exp (- Nena Vaa Ealo) ’ x eR’, 


where G(x) = exp(—||z||p), £ < 0 € RI, is a standard EVD and 


Yo, (x) = log (Ga; (£)),  0< Ga(#) <1, 1<i<d, 
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—(-2)%, C20) if ai > 0, 
— r% , T > 0, af Qi < 0, 
—exp(—z), «ER, if oy = 0. 


Note that pa, is a strictly monotone increasing and continuous function, 
whose range is (—0o, 0). The reverse implication in the preceding result is true 
as well. Possible location and scale shifts in each component can clearly be 
incorporated. 

Theorem 2.3.4 is an obvious consequence of Theorem 2.3.3 and the next 
lemma. 


Lemma 2.3.5 Suppose that the rv X = (Xj,...,Xa) has df 


Ni = Wa, (Xi), 1 < 1 < d. 


Then n = (m, ..-, na) follows the SMS df Ga,...1)- 
On the other hand, suppose that the ru n = (m,..-,"a) follows the 
SMS df Ga,...1)- Then 


= (X1,...,Xa) = (Way (Mm) Vaa (na)) 


has df Gia, 


Proof. We establish the first part; the second part is obvious. Since Go, (X;) 
is uniformly distributed on (0,1), it is clear that n; = log(Ga,(Xi)) has df 
exp(x) = Gi(a), x < 0. It remains to show that the df of the rv 7, say H, is 
max-stable. But this follows from the fact that G(o1,...,œ4) is max-stable with 
G® (vg; (2/n)) = Ga, (vq; (x)): For xi < 0, 1 <i < d, we have 


H(x1/n,...,ta/n)” 

= P(ņ < xi/n,1 <i < d)” 
P(X; < Yz} a 
P(X, < 43 (zi), 1<i<d) 

= H(21,..., £a). 


Lemma 2.3.5 can also be formulated as 


Gloi,...aa) (yai (21), Hes gis, (za)) = Grice Fis reza za), 


for x; < 0, i < d. The max-stability of G(a,,....a,) is therefore preserved by 
transforming each margin onto the standard negative exponential distribution. 
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Example 2.3.6 With the bivariate D-norm ||(x,y)||p, , as defined in 
Lemma 1.10.6, we obtain from Theorem 2.3.3 that 


G(x, y) := exp (- ieai x,y <0, 


defines a bivariate SMS df. Let the rv 7 = (m, 12) follow this df. Then, 
by Lemma 2.3.5, the rv 


X = (X1, X2) := (Y3 (m), ba (mp) 


follows a bivariate max-stable df with Gumbel margins. Precisely, we 
have 


P(X, < z, X2 < y) = P(m < Yo(x), n2 < Yoly)) 


Ber (- oz), Yow))IIb...) 


= exp (= [I(exp(-2), exp(-))Ilp,,.) 


= exp (- exp(—2) (= 


= Goo) (2; Y), LYE R, 


which is the bivariate Huisler—Reiss distribution, with parameter À = 
Vt — s/2; see, for example, Falk et al. (2011, Example 4.1.4). 


MIN-STABLE DISTRIBUTIONS 


Let XX)... be independent copies of the rv X on R?. The rv X (or its 
distribution) is called min-stable if there are constants an > 0 € Rf, bn € RI, 
n € N, such that, for each n € N, 

P (ea > 2) =P(X>2), «eR. (2.25) 


an 


Multiplying both sides by —1, equation (2.25) becomes 


an 


ey XO ph 
é 2 < -æ ) = P(-X < —2), eR’, (2.26) 
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i.e., the rv X is min-stable iff —X is max-stable. As a consequence of Theo- 
rem 2.3.4, we obtain the representation 


P(X > x)= P(-X < -2) 
= exp (— ||(#ar(—21),---,Paa(—2a))llp) (2.27) 


with some D-norm ||-||,, on R? and ay,...,ag € R. If, in particular, ay = 
+++ = aq = 1, then we obtain 


P(X > x) = exp(—|lz\|_p), z>0cR’, 


which has standard exponential margins P(X; > x) = exp(—2), x > 0. Rep- 
resentation (2.27) provides the complete family of min-stable distributions, in 
arbitrary dimension d € N. 


TAKAHASHI REVISITED 


We can now present the original version of Takahashi’s characterizations 
in terms of multivariate max-stable dfs. In what follows, let the rv 7 = 
(m1,---;Na) have the SMS df 


P(n < x) = G(x) =exp(—llel[p), æ<0eER, 


with an arbitrary D-norm ||:||p on R¢. Theorem 1.3.1 can now be formulated 
as follows. 


Theorem 2.3.7 With 7 as above, we have the equivalences 


(i) m,.--,Na are independent 


4> Jy<0eR?: 
d 


Pm < yi, 1 <i < d) =|] Pin < yi). 


I=L 


(i) m =N =: =Na a.s. 


=> P(m Sips =o. = — P =l) 


Proof. If m,...,ņa are independent, its df can be written explicitly: 


d 


Pr < 21,...;, 48 < za) = | | P(n < 2) 
{=I 
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ose) 


=exp(—||e|,), © <O0ER®, 


i.e., the D-norm corresponding to n = (m,..., na) is ||-||,. lfm = 2 =- = 
Na a.s., then 
P(m <%1,..-,Na < ta) = P(m < min(a1,...,2a)) 
= exp(min(21,...,2a)) 


= exp (— ||zl|,,.), £z<0ER?. 


The assertion now follows from Theorem 1.3.1. 


The next characterization is an immediate consequence of Theorem 1.3.4. 
Note that, for arbitrary 1< i < j < d, 


= exp (— lle: + e;llp) - 


Part (ii) in the next result is obviously trivial. We list it for the sake of 
completeness. 


Theorem 2.3.8 With 7 as above, we have the equivalences 


(i) m,..., na are independent iff m,...,Na are pairwise independent. 


(i) m= =- = na a.s. iff q, ..., Na are pairwise completely depen- 
dent. 


According to Theorem 2.3.4, the distribution of an arbitrary d-variate 
max-stable rv can be represented by means of an SMS rv 77 together with 
a proper non-random transformation of each component mi, 1 < i < d. The 
preceding characterizations, therefore, carry over to an arbitrary multivariate 
max-stable rv. 

Note that pairwise independence of rvs in general does not imply complete 
independence. Take, for example, an rv U, that realizes in the set {1, 2,3, 4} 
and attains each element with equal probability 1/4. Put X, := 1,1,2;(U), 
Xə := 141,33(U), X3 := 1y2,33(U). Then, X1, X2, X3 are pairwise independent, 
but they are not completely independent: 


P(X, =1, X2 = 1, X; = 1) = 0 4 P(X, =1)P(X2 =1)P(X3 = 1) = 


1 
a 
PICKANDS DEPENDENCE FUNCTION 


Take an arbitrary D-norm on R?. Obviously, for æ 4 0 € R®, we can write 
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xz 
SF Al) ha (=) , 
iehl» jel 


where A(-) is a function on the unit sphere S = {y € R°: |lyllı = 1}. It is 
ea that it suffices to define the function A(-) on S4 := {u > 0 € Ri! :; 


Da a uj < 1} by putting 


d—1 
u) = U1,---;Ud—1, L — y Ui 
i=1 


The function A(-) is known as Pickands dependence function, and, according 
to Theorem 2.3.3, we can represent any SMS df G as 


G(x) = exp (— ||a||p) 


EOE (o) 


An arbitrary max-stable df can be represented correspondingly. 
In particular, in the case d = 2, we obtain, for u € [0,1], 


lælp = Wells 


D 


A(u) = ||(u,1 — u)||p = E (max(uZ1, (1 — u)Z2)) , 


with A(0) = A(1) = 1, max(u,1 — u) < A(u) < u + (1 — u) = 1. According 
to the normed generators theorem there exists a generator (Z1, Z2) of ||-||p 
with Z2 = 2 — Z1; see Corollary 1.7.2. This entails a more refined analysis of 
the function A(-) in the bivariate case. For a further investigation we refer to 
Falk et al. (2011, Chapter 6). 


THE EXTREMAL COEFFICIENT 


To measure the dependence among the univariate margins by just one number, 
Smith (1990) introduced the extremal coefficient as that constant € > 0, which 
satisfies 

G*(a,...,") = H (x), x ER, 


where G* is an arbitrary d-dimensional max-stable df with identical univariate 
margins GÏ = --- = G} =: H. If we have independence of the margins, then 
€ = d, and if € = 1, we have complete dependence. 

Two questions naturally occur: Can we characterize this €? Does it exist 
at all? 

According to Lemma 2.3.5, we can transform wlog the margins of G* to 
the standard negative exponential distribution exp(x), x < 0, thus obtaining 
an SMS df G. For this, we have according to Theorem 2.3.3 the representation 
G(x) = exp(- ||æl||p), x < 0 € R?, with some D-norm ||-||p on Rt. As an 
immediate consequence, we obtain 
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G(z,..., £) = exp(- |, 10258) 5) 
= exp(z |1|| p) 
= exp(zx)!H4llo, x <0, 
yielding 
e= |ti € [d], (2.28) 
according to the general inequalities ||| < ||-l|p < ||-l|, in (1.4). The ex- 
tremal coefficient is, therefore, the D-norm of the vector 1. 
For the family of logistic D-norms ||z||,, = os Iz:P) a p € [1,00], we 
obtain, for example, 


d, if p= 1, 
Itl, = § d”, ifp eE (1,00), 
1, if p = œ. 


From Takahashi’s characterization in Corollary 1.3.2, we already know that 
for an arbitrary D-norm ||-|| p 


ya flee. B 
l-l» he = it» n 


Suppose a df F is in the domain of attraction of an arbitrary multivariate 
EVD G* as defined in (3.1). The df G* has corresponding D-norm ||- ||p by 
Theorem 2.3.4; thus, € = ||L|| p is a measure of the tail dependence of F. 

A refined tail dependence coefficient, which measures the dependence be- 
tween tail independent margins, is defined in equation (5.21). 


2.4 How to Generate Max-Stable RVS 


In this section, we show how a max-stable rv can be generated by means of 
independent copies of a generator of the corresponding D-norm. A well-known 
representation of order statistics from the uniform distribution on (0,1) will 
be a crucial tool. As an application, we obtain a sharp lower bound for the 
survival function of an SMS rv. 


A CRUCIAL REPRESENTATION OF ORDER STATISTICS 


In what follows, we denote by Z™®, Z®),... independent copies of a gen- 
erator Z of a D-norm ||-||p on Rt. Let U1, U2,... be independent and on 
(0,1) uniformly distributed rvs, which are also independent of the sequence 
Z® eA) en 
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Denote by 
Ui:n < Van < FAN < Unin 
the ordered values of U1,..., Un, n € N, or order statistics, for short. It is well 
known that , 5 
Dai Ek 
(Uin) =D j= 5 (2.29) 
wai k / jy 
where Fy, E2,... are iid standard exponential rvs; see, for example, Reiss 


(1989, Corollary 1.6.9). In what follows, we suppose that the sequence 
E,, E2,... is independent of the sequence Z“), Z@),... as well. 


GENERATION OF STANDARD MAX-STABLE RVS 


The next result shows how to generate an rv with df G(a) = exp(— ||z||p), 
x <0 € R?, by means of the two sequences E1, E2,... and Z,Z@),... 


Proposition 2.4.1 Put V; := 1/ yy Ex, i E N. Then, the rv 


1 
n := 


ae, (ViZ@) 


has the df G(x) = exp (- ||æ||p), £ < 0 € R°. 


Clearly, a simple max-stable rv € with df P(€ < æ) = exp (-— ||1/æ]| p), 
x > 0 € R$, is generated by putting 


g= -7 = sp (V20). 


Proof. Adopting the arguments in (2.11) and (2.19), we obtain, for æ > 0 € 
R4, even without the assumption that Z is bounded, 


1 1 ; 1 
P | — max — 7%) <r 5 xp | — ||— 
(G PET (z ) ) 7 ( |; 


where € is a simple max-stable rv. 
Clearly, for each n € N, we have the equality 


1 l og 1 1 ; 
P (+ max (52) < 2) =P G max ( z) < Ej, 
n 1<i<n \ Ui n 1<i<n \ Vin 


owing to the independence of the sequences U;,U2,... and Z®, Z®,.... 
From representation (2.29), we obtain 


) = P(E <æ), (2.30) 


D 
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n+1 
E 1 ; 
=P &k=1 k max = Z) <z]. 
n 1<i<n an Ek 


The law of large numbers implies ah Ex /N >n 1 a.s. Moreover, 


1 ; 1 ; ; 
max = Z” noo SUP = = sup (viz) = £, 
StS \ Sy Ek ieN \ 0-1 Ek ieN 


where we know from equation (2.30) that € has the df 


1 


T 


P(E < x) = exp (- 


). z>0ER?. 
D 


Putting 


completes the proof. 


SURVIVAL PROBABILITY OF STANDARD MAX-STABLE RVS 


The representation of an SMS rv in Proposition 2.4.1 enables the derivation 
of a lower bound for its survival probability. 


Lemma 2.4.2 Letn be an SMS rv on R with corresponding D-norm 
II'Ilp- We have, for x < 0 € R, 


(i) P(n > x) > 1 — exp (- X £ ùp), 


(n > sx) 


B 
(it) H adr ada a Rx Up. 
S 


Proof. We suppose the representation 
_ 1 
 suDjen (V;Z) 


from Proposition 2.4.1. Using the notation in its proof, we obtain 


P(n>2)=P (sup (viz) i a) _1_p (sup (v.2) z a) 


icN |æ] icN |æ] 
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with 
P (sup (viz) Z a) 
ieN |x| 


-r(Ua tear} 


j=1ieN 


(here comes the crucial inequality) 


~- NU {« Mz" < rea} 


iE€N j=1 


(the continuity of probability theorem implies) 


n d 
= jim, P NU < of 
i=1j=1 


n d i 
Ca USE >k z} 
i=1 j=1 \k=1 


n d i (i) 
-1 Ek |x5| 2; 
. k=1 
= piua 4 N U n+l E 2 rt 
i=1 j=1 k=1 k n 03: Tan) 


ll 
B 
mv) 
ie 
m 
S 
3 
V 
| 
fs 
AB 
AB 
a 
o> 
8 
= 
NS 
L 
Sal 


n00 


ll 
Fi 
be 

me) 

Zo 
SDE 

S 

IV 

|= 

cE, 

AS 
= 

È 
N 
< 
“sa 
ee 
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(by the independence of the sequences U;,U2,... and Z“), Z),...) 


1 n 
= lim p(u> ee (|2j| Z; )) 


n— oo 


(where U is on (0,1) uniformly distributed and independent of Z) 
siw G= (a a A a 
— nao n 122a HAS 
= exp (—# ( nin (esl 25)) ) 


=exp(-V zp), 


which is part (i). 
Part (ii) follows from the inclusion—exclusion principle (see Corol- 
lary 1.6.2), together with (1.10): 


d 
P(n > sxz)=1-— P (Ù {ni < sz) 


i=l 
=1- So (DPI P(n < sai, i € T) 
OAT C{1,...,d} 
= So DFIG — P(n < szi, i€ T)). 
OAT C{I,...,d} 


But 


1— P(n; < szi, i € T) = 1 — exp (-x (zts [zi] Zi ))) 


= 1-0 (-s6 (gs 29)) 
= 3E (uss z:)) + o(s) 


according to the Taylor expansion exp(<) = 1 +€ + o(£) as € — 0; thus, 


Pin>sa)=s DO (DTE (maxal Z:)) +009) 


OAT C{Il,...,d} 


=sk 5 (—1)71* max(|2| Zi) + o(s) 
OATC{I,...,d} = 


=sE (mi min (\2il Z; ») + o(s) 


strata 


according to Lemma 1.6.1. This completes the proof of Lemma 2.4.2. 
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2.5 Covariances, Range, etc. of Standard Max-Stable rvs 


Let n = (m,---,7a) be an SMS rv. In this section, we compute the covariance 
Cov(m,7;) between its components, their Lj-distance E(|n; — 7;|) as well as 
the expected range E (maxi<i<am — mini<i<a7j). The latter is particularly 
useful within the framework of functional extreme value theory. 


THE COVARIANCE OF A BIVARIATE SMS Rv 


The D-norm approach offers appealing representations of both the covariance 
of the components of a bivariate SMS rv, as well as their L,-distance. Clearly, 
this result immediately carries over to the components ni, n; of an arbitrary 
SMS rv 7 = (m,---,7a) on R? with D-norm ||:||p. In this case, the D-norm 
corresponding to the bivariate SMS rv (m,n), i < j, is given by 


(2, Wllp,, = B(max(|2| Zi, y| Z;)) = lre: + yelp, ty ER, 


where Z = (Z1, ..., Za) is a generator of ||-||p. 


Lemma 2.5.1 Let n = (m,72) follow a bivariate SMS df. According 
to Theorem 2.3.3 there exists a D-norm ||- ||p on R? with P(n < x) = 
exp(— |x|), £x < 0 € R?. Then, 


oo 1 
E = — dt 
a | Ion 


As E(m) = E(n2) = —1, Var(m) = Var(n2) = 1, the covariance and 
the correlation coefficient of m, n2 are consequently given by 


% j 
Cov(m, n2) = T dt — 1 = p(m, n2). 
0 


2 


The proof of the preceding lemma is based on an ingenious representation 
of general covariances, called Hoeffding’s identity. It shows in particular that, 
in a certain sense, the covariance is a measure of dependence. For the sake of 
completeness, we state it explicitly, along with a proof. 


Lemma 2.5.2 (Hoeffding’s Identity) Let X, Y be square integrable 


rus on R. Then, 


Cov(X,Y) = E(XY) — E(X)E(Y) 


Y)- 
=e T P(X <x,Y <y)— P(X <x)P(Y < y)dz dy. 


Proof. Let (X1, Yi), (X2, Y2) be independent copies of (X,Y). Then, 
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E((X1 — X2)(Yi — Y2)) = 2Cov(X, Y). 


We can write (X1 — X2)(Yi — Y2) as the product of two integrals: 


where we have used the equality 1(z,..)(X1) = 1 — 1(c0,2](X1) etc. As a con- 
sequence, by using Fubini’s theorem, we obtain 


E((X1 — X2)(Yi — Y2)) 
_[ fre (c0,2](X2) — 1(00,2](X1)) (Leou (Y2) — 1o, (¥1))) dz dy 
_ D im Peny ap SP adedi; 


where this time we have used the equality E (1(c0,2](X2)1(co,yj(Y2)) = P(X < 
x,Y < y) etc. This completes the proof. 


Proof (of Lemma 2.5.1). From Lemma 2.5.2 and Lemma 1.2.2, we obtain 
Cov(m, 72) = J J P(m < z,n2 < y) — P(m < £)P(n2 < y) dx dy 
-| J P(m < z,n2 < y) — P(m < x)P(m < y) dz dy 


0 0 
-/ / P(m < 2,72 < y) dv dy — E(m)E(n2) 


0 0 
=| / P(m < 2,2 < y) dx dy —1 


= E(mm2) — 1. 


But 


o 0 
J i P(m <2, 12 < y) dx dy 


fT. 
Ld 


eoa DL 


exp (— ||(#, y)||p) dy dx 
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0 oo 
= -J | rexp (2 |(1,Y)llp) dy de 


T x£ 
=— — exp(z) dz dy 
o Je II(L,v)llp 

i 1 


-o e 
i lao” 


by substituting first y + avy and then x +> 2/||(1,y)||p; we also used the 


equation pes x exp(x) dx = —1. This completes the proof. 


Example 2.5.3 In accordance with the characterization of the inde- 
pendence and complete dependence cases in terms of D-norms, we ob- 
tain for ||-||p = Ilh 


co 1 
c E ren, 
ov(m, n2) (t+ 1)? 


and in the case of ||-|| 5 = Illa 


2 1 
Cov(m, n2) = ih Sy aor 
(mst) = J (max(t, 1) 


In addition to this, by substituting t 4 t!/?, we obtain for a general 
logistic D-norm ||-||,, with parameter p € [1, 00) 


1 


2 ee, 
Cov(m, n2) | TESIA dt 


where 


ae 


1 ee) 
= a—l1 INU = ate ae 
B(x, y) | Ge (lt) = dt | G+ dt, x,y > 0, 


denotes the beta function. (Apply the substitution t > 1/(1 + t) to the 
first integral to obtain the final equation.) 


THE Ly-DISTANCE BETWEEN STANDARD MAX-STABLE RVS 


It is easy to compute the L1-distance between the components of an SMS rv. 
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Lemma 2.5.4 Let n = (m,12) be as in the previous Lemma 2.5.1 and 
let Z = (Zı, Z2) be an arbitrary generator of the D-norm. Then, we 
have the representation 


1 ) _ EZ = 22) 
Dlo) 16D 


As ||(1, 1)||p és greater than one and less than two according to equation 
(1.4), the preceding equation implies the bounds 


E(|Zı — Z2|) 


5 < E(\m — n|) < E(|Z1 — Za). 


The preceding result implies that E(|ņ — 2|) = 0 and, thus, m = 7 
a.s. iff ||(1,1)||p = 1. This coincides with Takahashi’s characterization (see 
Corollary 1.3.2). These arguments also entail that the sup-norm ||-||,, can 
only be generated by a generator Z with identical components a.s., because 
it is only in this case that we have E(|Zı — Z2|) = 0. 


Proof (of Lemma 2.5.4). From the equation 


a+b |b-al 
+ 


2.31 
a (2.31) 


max(a, b) = 
which holds for arbitrary numbers a,b € R, we obtain 


E(\m — m|) = 2E(max(m,2)) — E(m + n2) = 2(1 + E(max(m, 72))). 


From Lemma 1.2.2, we obtain 


0 
E(max(,,2)) = -f P(max(m, n) < t) dt 


_ ay P(n < (t,t) dt 

0 
=- f exp (—[I(E,8)llp) dt 

Pa 1 
=- | exp. Dlo) #= -rD 


by using the substitution t > t/ ||(1, 1)||p and the fact that Seo exp(t) dt = 1. 
This proves the first equation in Lemma 2.5.4. 
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Applying equation (2.31) again, this time to max(Z1, Z2), we obtain 


IG, Dilp = E(max(Z1, Z2)) 
E(Zı + Z2) | E(\Z1 — 22|) 
= DALTA p AL R 
2 2 
E(|Z1 — Z|) 


=1 
= 2 


and, thus, 


E a _oGDllp-1 _ BUZ1 - Zal) 
2(1 To 2 ado 0D 


which completes the proof of Lemma 2.5.4. 


THE RANGE OF THE COMPONENTS OF A MAX-STABLE RV 


The following result extends the upper bound on the L,-distance of the com- 
ponents of a bivariate SMS rv in Lemma 2.5.4 to an arbitrary dimension. 


Lemma 2.5.5 Let n = (m,... na) be an SMS rv on RÌ with corre- 
sponding D-norm ||:||p. We have 


1<i,j<d 1<i<d 1<j<d 


1 1 


< ; 
Rizo |lllp 


B( max m= nl) = E ( max ni — min m) 


Recall that R 1 Xp can be zero, in which case the preceding upper bound 
is infinity and less helpful. 


Proof. From Lemma 1.2.2, we obtain 
0 
e (max m) =- f P (max. <t) dt 
1<i<d =a 1<i<d 
0 
=- f P (n < t1) dt 
r0 


=- exp (— |lt1llp) di 


1 0 1 
--— | exp(t) dt = —-__.,, 
lb Jo Llp 


0 
=- f exp (t ||Lllp) at 
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using the substitution t +> t/ ||1||>. In complete analogy, we obtain 


0 
E ( mio m) =- f P ( miom <t) at 
1<i<d = 1<i<d 
0 
=- f 1- P (min m >t) dt 
= 1<i<d 


0 
al P(n > t1)—1dt 


0 
1 
a= exp (tU 1p) dt = -——_, 
iz ( ) Lip 


using the bound P(n > x) > 1 — exp(-U zp), x < 0 € RI, from 


Lemma 2.4.2 and the homogeneity XV tæ vp = |t| Q s Np. 
Thus, we have established 


1 1 
E ,)-E( min m) < — - ——. 
(msm) (mn) <a nE 


For the Marshall-Olkin D-norm with parameter A € [0,1] as in (1.8), we 
obtain from Example 1.6.4 


Llp, =A+d(1—A), igp, =A, 
and, thus, the upper bound 
1 1 1 1 
v1, lllo, A A+ -A 


It is interesting to note that this upper bound converges to 1/A if the dimen- 
sion d tends to infinity. 


2.6 Max-Stable Random Vectors as Generators of 
D-Norms 


In this section, we pick up an idea in the proof of Theorem 2.3.3 and use max- 
stable rvs as generators of D-norms. As an example, we obtain a generator of 
the logistic D-norm ||-||,,, 1 < p < œ, as in Proposition 1.2.1. 

Let the rv 7 = (m,.-.-,7a) follow the SMS df 


G(æ) = P(n < x) =exp(—llal|p), æ< 0€ R°. 
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Choose c € (0,1). Then, the rv |7;|~° has the df 


1 1 
p (r <1) =p G < inl") 
A £ 


1 
-ep (=). z>0,1<i<d, 


i.e., [m| | follows the Fréchet df F(x) = exp(—a~°), x > 0, with parameter 
a = 1/c; note that P(n; = 0) = 0. Its expectation is, by (2.23), ue = (1 — c). 
The rv 


: 1/1 1 
A = (Z,..., 7) = — E (2.32) 
1 d 
Me \\m| Inal 


now satisfies Z® > 0 and E(Z) =1,1<i<d,ie., Z© is the generator 


i 


of a D-norm. Can we specify it? 


Note that the rv (Imi ete Imal’) follows a max-stable df with Fréchet 
margins: 
1 
H(x) =P ra <ajl<i<d 
ni 


II 
© 
PS 
ge 
l 
s/s 


Now, we specify the D-norm generated by Z“) as defined in (2.32). 
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Breposivicn 2.6.1 The D-norm ||: |p) corresponding to the genera- 
tor Z\°) defined in (2.32), is, for x € R4, given by 


= | z(e) = 1/c 1/6) [|F 
ella =E ( max (leil 2; )) = (a Pease) peas) 


1<i<d 


If m,...7a¢ in the preceding result are independent, i.e., if the underlying 
D-norm ||-|\ 5 is ||-||,, Proposition 2.6.1 yields that Z“ generates the logistic 
norm 


1 1 
lll pe = | (leil"”*,.--leal/”) 


Cc d É 
lje 
|= (3 es = |læll1ye- 
11 


This was already observed in Proposition 1.2.1. 
It is clearly a purely mathematical question, but, nevertheless, an obvious 


c 
and interesting one: what is the limit of | Gik ee Izal) | fore> 1, 
D 


or for c —> 0, if it exists? 
The answer is actually easy: clearly, 


f 1/c 1/e\ E 
tn (etl tea”) = leto. 


On the other hand, from the fact that each D-norm is larger than the sup- 
norm ||-||,, and smaller than the norm ||-||, (see (1.4)), we obtain 


© 1e) 7 
> s, = s = 
= (amy (m) ) = na (a) = lel 


1<i<d 


| (leat. fal”) 
by Lemma 1.1.2. Hence, we have 
lim || (111°, sal) 


Proof (of Proposition 2.6.1). From Lemma 1.2.2, we obtain that, for x > 0 € 


E (ax («.2{°)) = ZE (ma x(=.)) 
1<i<d he iisa Iml 
1 ¢* i 
=— P (max (Se) > 1) a 
He Jo 1<i<d [m| 


1 f% ; 
ees 1=P (max (= =) st) a 
Le Jo 1<i<d Iml? 


= > je) eno [ltlloo 


c 
| = Hels 
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1 se 

== 1-P(.<uisisa)at 
He Jo A 
1 f” 1 ot 

ee 1-P( <4 isisa)a 
Me Jo [nil i 


1 f° a ( | 1 1 
= — exp | — || ———.,...., — 
He Jo (¢/x1)'/° (t/xa)'/° 


)a 
D 


1 oe 1 1/c 1/e 

=— 1- — sxa dt 
He Jo exo ( 1ye G asar ae D 
1 1/c 1/ey||° A 1 

= — ; 1- — dt 
m ( 1 > Ta ) 4 exp fi/e 

: : 1/c 17e 5 
by the substitution t > || (£1 %,..- £4) 2 t. 


The integral {>°1 — exp(—1/t!/°) dt equals E(Y) according to the 
Lemma 1.2.2, where Y follows a Fréchet distribution with parameter 1/c. 
It was shown in (2.23) that E(Y) = ue, which completes the proof. 


ITERATING THE SEQUENCE OF GENERATORS 


Taking this new D-norm ||||p in (2.33) as the initial D-norm ||-|| and 
proceeding as before leads to the D-norm 


e2 


? 


2 2 
Jenassa a r 


x > 0 € Rt. We can iterate this procedure and obtain in the n-th step 


1/e” i/e \ Je” 
lies Pallpes = (er eaa 


oe 
This begs the question: Does this sequence of D-norms converge? 
Note: if we choose ||-||, = ||-||,,, then we obtain, for x > 0 € R?, 


1/c” 1/c” 
Wea”) 


This may raise the conjecture that the sequence of D-norms converges to the 
sup-norm ||-||.., if it converges. This is actually true and can easily be seen as 
follows. 

Recall that ||-||,5 < |l-llp < lkl; for an arbitrary D-norm and that c € 
(0,1). As a consequence, we obtain 


c” 7 1ye” c 7 E 
p (2a ae (£1; -3 £a)lloo + 


n ny Jo" 
ecean e ma) 


D 
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>n oo I (Gass ee 2a)lles, x>0eER?, 
according to Lemma 1.1.2; hence, 


(E1; -+ £a)ll oe noo ||(@1,--+ alloc - 


® 


Check for 
updates 


3 


Copulas & Multivariate Extremes 


This chapter reveals the crucial role that copulas play in MEVT. The D-norm 
approach again proves to be quite a helpful tool. In particular, it turns out 
that a multivariate df F is in the domain of attraction of a multivariate EVD 
iff this is true for the univariate margins of F together with the condition 
that the copula of F in its upper tail is close to that of a generalized Pareto 
copula. As a consequence, MEVT actually means extreme value theory for 
copulas. 


3.1 Characterizing Multivariate Domain of Attraction 


In complete analogy to the univariate case in (2.2), we say that a multivariate 
df F on Rê is in the domain of attraction of an arbitrary multivariate EVD 
G, again denoted by F € D(G), if there are vectors an > 0, bn E R, n EN, 
such that 

F” (ana + bn) —ns0 G(x), x eR’. (3.1) 


Recall that all operations on vectors are meant componentwise. See Theo- 
rem 2.3.4 for the family of possible limits G. 


SKLAR’S THEOREM 


A copula is a multivariate df with the particular property that each univariate 
margin is the uniform distribution on (0,1). For an exhaustive account of 
copulas we refer to Nelsen (2006). Sklar’s theorem plays a major role in the 
characterization of F € D(G) for a general df F on Rê. 
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Theorem 3.1.1 (Sklar (1959, 1996)) For every df F on R? with 
univariate margins F\,...,Fq there exists a copula C such that 


F(x) = C(Fi(21),..., Fa(xa)), £ = (ap, sang Za) E RÊ. 


If F is continuous, then C is uniquely determined and given by 
C(u) = F(FT' (u), ..., Fr (ua), w= (u, ---, ua) € (0, DŽ, 


where F+ (u) = inf{t € R : F;(t) > u}, u € (0,1), is the generalized inverse 
of F;. The copula of an rv Y = (Yj,..., Yu) is meant to be the copula of its 
df. 

If Y = (%4,..., Ya) is an rv such that, for each i € {1,...,d}, the df F; 
of Y; is in its upper tail a continuous function, then the copula C of Y is, for 
u = (u1, ... uq) close to 1 € R%, uniquely determined and given by 


C(u) = P(Fi(V1) < u,..., Fi(Ya) < ua). 


Sklar’s theorem can be formulated in terms of rvs as follows. Let Y = 
(Y1,..-, Ya) be an arbitrary rv and denote by F; the df of Y;, 1 < i < d. There 
exists an rv U = (U;,...,Ua), which follows a copula, i.e., each U; follows the 
uniform distribution on (0,1), such that 


Y =p (FT iin Fy (Ua)). 


We frequently use this version. 


INTRODUCING GENERALIZED PARETO COPULAS 


By (2.15), the copula C of an rv Y = (Y1,..., Ya), which follows an arbitrary 
multivariate GPD as in (2.17), with corresponding D-norm ||-|| satisfies the 
equation 


C(u) = P (Ya, (Y1) +1,---, PYaa(Ya) +1) < u) 
= P(Wi < w — 1,..., Wa < ua — 1) 
=1-||(1—m,...,1-uallp, Up <uUu<1ER%, 


for some Uo < 1 € R¢. A copula C on R? with such an expansion 
C(u)=1-|1-ullp, wuw<u<1eR’, (3.2) 


for some uo < 1, is called a generalized Pareto copula (GPC). These copulas 

turn out be a key to MEVT; see, for example, Proposition 3.1.5 and 3.1.10. 
Note that any marginal distribution of a GPC C is a lower dimensional 

GPC as well: if the rv U = (U1,..., U4) follows the GPC C on R4, then the 
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tv Ur := (Uj,,...,Ui,,) follows a GPC on R!™! for each non-empty subset 
T = {i1,...,4m} C {1,..., a}. 

The characteristic property of a GPC is its excursion stability, as for- 
mulated in the next result. The conclusion “=” follows from the proof of 
Lemma 3.1.13 later in this chapter. The reverse implication is just a reformu- 
lation of Falk and Guillou (2008, Proposition 6). 


Proposition 3.1.2 Let the rv U = (Uj,...,Ua) follow a cop- 
ula C. Then, C is a GPC iff for an arbitrary non-empty subset 
) is exceedance 


Te iem) Of (lad) the rur = (Uys arn, Ue 
stable, i.e., 


P(Up >1-—tu)=tP(Urp>1—u), +t€[0,1], (3.3) 


for u close to 0 € R”. 


If P(Ur > 1-— u) > 0, then (3.3) clearly becomes 
P (Ur > 1 -— tu | Ur > 1-u)=t, t € [0,1]. 


But P(Ur > 1 -— u) can be equal to zero for all u close to 1 € R™. For 
example, this is the case when the underlying D-norm ||-|| is ||-||,. Then, 
V- Up = 0 (see equation (1.12)), and thus, P(Ur > 1 — u) = 0 for u close to 
0 € R”, unless m = 1; see Lemma 3.1.13. 

Different from the definition of the family of univariate GPD as in 
(2.7), the definition of multivariate GPD is not unique in the literature. 
There are different approaches (see, e.g., Rootzén and Tajvidi (2006); Falk 
et al. (2011) or (2.17)). The following suggestion may help to conclude 
the debate. 


Remark 3.1.3 Proposition 3.1.2 offers another way to define an arbi- 
trary multivariate GPD:. An rv Y = (Yj,..., Ya) follows a multivariate 
GPD iff its copula is excursion stable, in which case it is a GPC, and 
each component Y; follows a univariate GPD in its upper tail as in (2.7). 
Note that this coincides with the definition of a multivariate GPD as in 
(2.17). 


Example 3.1.4 Let X = (X1,...,Xa) be an rv, whose corresponding 
copula C is a GPC, and let each component follow the standard Pareto 
distribution, i.e., P(X; < x) = 1 — x7! =: F(x), x > 1. Note that 


1 ~ F(ta) = 5(1 — F(a), yee lle 


By Sklar’s theorem 3.1.1, we can assume the representation 


B= E dara (6h) 
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where U = (Uj,...,Ua) follows the copula C. For æ = (z1,..., £a) 
large enough, we obtain 
P(X > ta) 
P(X >a) 
_ P(U; > F(tai), 1<i<d) 
PUR h(a) Laars d) 
_ Pils 21 — (= Fite), Lat = a) 
PU >1-(—F(m)), 1<i<d) 
P (U; >1-4(1— F(x)),1<i<d) 


P(X >ta|X>2)= 


t 


P(U;>1—-(—F(a)), 1<i<d) 


; t21, 


1 
t 


by equation (3.3), provided P(U > u) > 0 for all u € (0,1)? close 
to 1 € R¢. The preceding result can easily be extended to arbitrary 
univariate generalized Pareto margins as given in (2.7). 


The previous result shows that if one wants to model the copula of multi- 
variate exceedances above high thresholds, then a GPC is a first option. 


DOMAIN OF ATTRACTION FOR COPULAS 


The df of the uniform distribution on (0,1) is H(u) := u, u € [0, 1]. We obtain, 
therefore, with an = 1/n, bn = 1, n € N, for x < 0 and large n 


H” (anx +bn) = H” (= + 1) = (1 + z) noo exp(z), 


i.e., each univariate margin of an arbitrary copula is automatically in the 
domain of attraction of the univariate SMS df G(x) = exp(x), x < 0. 

The following conclusion is a consequence: if a copula C on R? is in the 
domain of attraction of an EVD G, 


g” (1 + =) n+ G(x), £<0€R’, 


then G has necessarily standard exponential margins, i.e., G is a (multivariate) 
SMS df. According to Theorem 2.3.3, there exists a D-norm ||-||,, such that 
G(x) = exp(—||x||,5), x < 0 € R*. This underlines the particular role of SMS 
df. The next result characterizes the condition C € D(G). It turns out that 
C € D(G) iff its upper tail is close to that of a GPC. 
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Proposition 3.1.5 A copula C on R? satisfies C € D(G), where 
G(x) = exp(—||z||p), x < 0 € RI, iff the copula C satisfies the ex- 
pansion 


C(u) =1—|]1- ally + of — ull) 


as u — 1, uniformly for u € [0,1]¢. 


The uniformity in the preceding result is meant as follows: For each € > 0, 
there exists ô > 0 such that 
|C(u) - (1 = |]1 = ull) 


<e, if well—6,1]?, uw £1. 
lial | 


The norm ||-|| in the denominator can be arbitrarily chosen, due to the fact 
that all norms on R? are equivalent. 

As an example, we show in Corollary 3.1.15 that an Archimedean copula 
Co on Rf, whose generator function ¢ satisfies condition (3.11) below, is in 
the domain of attraction of an SMS df with corresponding logistic D-norm. 


Proof (of Proposition 3.1.5). The implication “<=” is obvious: we have for 


z<O0cR? 
xz\nr 1 iy" 
c(1+=) = (1- + jel, +0(=)) 


noo exp(— lællp) =: G(x), 


where G(-) defines a standard max-stable df by Theorem 2.3.3. 
Next, we establish the implication “=.” Suppose that C € D(G). We have, 
consequently, 


cn (1 A zy >n G(£), 


where the norming constants are prescribed by the univariate margins of C. 
Repeating the arguments in the proof of Theorem 2.3.3, one derives from the 
above convergence 


c (1+2) >o Gla), ser’. 


As the limiting df G(æ) = exp(- ||æ||p), x < 0 € Rf, is continuous, the above 
convergence is uniform in g, i.e., 


T 
sup |C" (1+ =) - exp (- lælip)| ee 
xr<0 t 


Taking logarithms, this implies for t > 2 


x 
sup [tiog (c (1 + =)) + lællp] >to 0. 
—1<æ<0 t 
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The Taylor expansion log(1 + £) = € + O(c?) for e > 0 yields uniformly 
for —1 <æ <0 €R’ andt>2 


log (c (1+2)) = log (1 ! (ea ! =) 1)) 
=c(1+2)-140((c(1+2)-1)), 


The lower Fréchet bound for a multivariate df (see, for example, Galambos 
(1987, Theorem 5.1.1)) for æ = (a1,...,2a) < 0 provides the inequality 


d 
x 1 


If we have in addition —1 < x < 0, this yields 


Ca) <8 


As a consequence, we obtain 


sup (eG +F) -1) + lela] > 0. 


Putting u := 1 + x/t, the preceding equation becomes 


sup |t(C(u) — 1) + ¢||1 — ull) | 
1-41<u<1 


=t sup |C(u)-(1- |1- ull») 
1-41<u<1 


=: r(1/t) >t% 0. (3.4) 


Choose u € [0,1]¢ with ||1— ul], < 1/2. The preceding equation with 
t:=1/||1— ull, implies 


lC) — = |]1 = ullp)| 


< r(t = ull); (3.5) 
lt — ull 
note that we can apply (3.4) with these choices of u and t, since 
1 1- u 
1—-1<u<1 =} 0< —m — <1, 
t I1 — ul]. 
which is obviously true. 
Equation (3.5) implies for u € [0,1], u 4 1, 
IC(u) - A= || = ull p) 
yy 0 0, (3.6) 


[1 — ull. 
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which is the expansion 
Clu) =1—|]1— ally + o(|1 — ull.) (3.7) 


as u —> 1. 

As described above, uniformity in u in the above expansion means that 
for all € > 0, there exists 6 > 0 such that the remainder term satisfies 
lo(||1 — ul|.,)| < € if |1 — ull, < 6. We prove this by a contradiction. Sup- 
pose this uniformity is not valid. Then there exists €* > 0 such that, for all 
6 > 0, there exists us € [0, 1]? with ||1 — usl| < 6 and |o(||1 — us||,,)| > e*. 
But this clearly contradicts equation (3.6). 

Since all norms on R? are equivalent, the remainder term o(||1 — ul|,,) in 
expansion (3.7) can be substituted by o(||1 — u||) with an arbitrary norm on 
R?. This completes the proof of Proposition 3.1.5. 


The following consequence of Proposition 3.1.5 provides a handy charac- 
terization of the condition C € D(G). 


Corollary 3.1.6 A copula C on R? satisfies C € D(G), with G(x) = 
exp(— ||a||,), x <O€ R4, iff for alla < 0 € Rf, the limit 


mia Cl + tx) 


li 


i 7 (3.8) 


exists in [0, 00). In this case £(x) = ||x|| p. 


The limit ¢(-) is also known as the stable tail dependence function of C 
(Huang (1991)). The fact that each stable tail dependence function is actu- 
ally a D-norm opens the way to estimating an underlying D-norm by using 
estimators of the stable tail dependence function. 


Proof. We know from Proposition 3.1.5 that the condition C € D(G) is equiv- 
alent to the expansion 


C(u) =1—|[1— ullp + o(||1 — ull) 


as u converges to 1, uniformly in [0,1]?. For æ < 0 € R? and t > 0, this 
readily implies 


1—C(1 + ta) = |ltx|[p + o(||tall) = tllællp + off) 


and, thus, the implication “=.” 

Suppose next that the limit in equation (3.8) exists. The Hoeffding— 
Fréchet bounds for a multivariate df (see, for example, Galambos (1987, The- 
orem 5.1.1)) imply the bounds 


d 
> 0 +ta;)-d+1<C(1+ta) < min (1 +4 tzi) 
= 1<i<d 
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thus, 


d 
pa |x;| > 1- C(1 + ta) > t max |æ:| . 
za 


The limit L(x) therefore satisfies 
læll < &æ) < æl, æ <0 € Rô. (3.9) 


This implies in particular that L(x) +2-,0 0 and ¢(a) — oo if one component 
of x decreases to — o0. 

The Taylor expansion log(1 +£) = € + O(c”) for e — 0 implies for æ < 0 € 
R? and n € N large, 


o (1+) =ef (e (1+ 8) 


Note that 


on (1+2) =P (n( max U® -1) <e), neéN, 
n 1l<i<n 


is a sequence of df, which converges by the above result to exp(—¢(a)) =: 
G(x), x < 0 € R?. From Helly’s selection theorem (see, for example, Billings- 
ley (1999)) and equation (3.9), we conclude that G(-) defines a df on Rt. We 
have to establish its max-stability: for m € N, we have 


La Ga) 


=i (om (E) cre 


or 
G(=)"=c@), xr<0eER?, 
which is the max-stability of G. Finally, we obviously have according to (3.9) 
L(ze;i) = |z], z<0,1<i<d, 


and thus, G has standard negative exponential margins. Theorem 2.3.3 implies 
G(x) = exp(- l||æl|p), £ < 0 € R?, which completes the proof. 
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Remark 3.1.7 Equation (3.9) in the preceding proof reveals why ||-||, 
and ||-||, are the smallest and the largest D-norms: this is actually due 
to the Hoeffding—Fréchet bounds for a multivariate df. 


Example 3.1.8 The Ali-Mikhail-Haq family of bivariate copulas is de- 
fined by 


fst 


C(u,v): 0< u,v <1, ve [-1,1]; 


see Nelsen (2006, Section 3.3.2). It satisfies the equation 


1— C(u,v) l-u l-v l1-—ul-v 
ace 1-9 
C(u, v) u 2 v a ) u v 


As a consequence, we obtain for z1, £2 < 0 and t > 0 small enough 


1 Cy(1 + ¢%1,14 tx2) 

t 
= Cy(1 + tzı,1 + tx2) —tz, i —txro Ay ) —tx, —trə 
t 1l+tz, 1+tae 1 + tzı 1 + tae 
ro [£1] + |x|. 


In this case, we obtain (£1, x2) = ||(x1, z2)||; and, thus, Cy € D(G), 
G((x1,£2)) = exp(— || (z1, v2)||,), z1, £2 < 0, which has independent 
components. 


EXTREME VALUE COPULAS 


An extreme value copula on R¢ is the copula of an arbitrary d-variate max- 
stable df G*. According to Theorem 2.3.4 for u € (0, 1], it has the represen- 
tation 

Cas (u) = exp (— ||(log(u1), ---,log(ua)) |p) - (3.10) 


On the other hand, the right-hand side of this equation defines an extreme 
value copula for any D-norm ||-|| 5. This is a consequence of Theorem 2.3.3. 


Example 3.1.9 The bivariate Hiisler—Reiss distribution with parame- 
ter A > 0 as given in Example 2.3.6, i.e., 


Gur, (£, y) = exp (- exp(—2)® (a J5 i) 


eo 
exp( ye (a+ F o z,y ER, 
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has according to equation (3.10) and Lemma 1.10.6, where the D-norm 
is explicitly given, the copula 


Cur, (u, v) 


= exp (lostwye (a PECE T80) 


+log(v)® Q i susdit j; , u,v € (0,1). 


2A 


For a discussion of parametric families of extreme value copulas and their 
statistical analysis we refer the reader to Genest and Nešlehová (2012). 


DOMAIN OF ATTRACTION FOR A GENERAL DF 


The next result goes back to Deheuvels (1984) and Galambos (1987). Instead 
of the D-norm expansion of Cr(u) below, they use in their original formula- 
tion the limit of C” (ut/”) as in Corollary 3.1.12. 


Proposition 3.1.10 A d-variate df F satisfies F € D(G) iff this is 
true for the univariate margins of F, together with the condition that 
the copula Cr of F satisfies the expansion 


Cr(u) =1—|[1—allp +0(]1 — ull) 


as u —> 1, uniformly for u € [0,1]%, where ||-|| 5) is the D-norm on R$, 
which corresponds to G in the sense of Theorem 2.3.4. 


A consequence of the preceding result is that multivariate extreme value 
theory actually means extreme value theory for copulas. 


Proof. Suppose that F € D(G). According to Theorem 2.3.4, G can be rep- 
resented by a D-norm ||:||p together with the functions w;(z) = log(Gi(z)), 
1 < i < d, where G; denotes the i-th univariate margin of G. In particular, 
we have that each univariate margin F; of F satisfies F; € D(G;), i.e., 


Taking the logarithm on both sides and applying the Taylor expansion log(1+ 
£) = £ + O(e?) for e 0, one obtains for x € R with G;(x) > 0 


or 


(Fi (anix + bni) — 1) >n log(Gi(x)) = wi(a). 
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Using Sklar’s theorem and repeating the preceding arguments, we obtain 
F” (ana + bn) Sns0 G(x) 
<> CB ((Fil@nias + bni) 41) >n G(x) 
=> CF ((: q MUR (aniti + bri) — D) ) >n G(T) 


n 


=> Cf ((: + ne) ) nc G(x) 


> OB (14 Ea) Prom @ (UOD) =e (- ooi) 


i= 


for (y1,..., Ya) < 0 € R? according to Theorem 2.3.4. Proposition 3.1.5 now 
has the implication “=” in Proposition 3.1.10. 

The reverse implication is easily seen: for = (x1,..., £q) with 0 < 
Gi(a;) <1, 1<i<d, we have 


F” (a,x + bn) 


= cp ( (4 Earth)" ) 


= (: = = |(0(Fi(anit + bni) — Dial, = D) 


noo EXp (— \| (a4 (x1), bane , Wa(£a)) lp) 
= G(a1, sae sa) 


according to Theorem 2.3.4. 


Remark 3.1.11 The original formulation of the preceding charac- 
terization by Deheuvels (1984) and Galambos (1987) is as follows: 
F € D(G) iff this is true for the univariate margins of F together with 
convergence of the copulas: 


Op (u) noo Celu) =G ( (Cr 0) ka): 


u = (u1,..., uq) € (0,1)%, where G; denotes the i-th margin of the 
general EVD G, 1 < i < d; see the following Corollary 3.1.12. 


Note that C% (ul/”) is the copula of maxı<i<n U“), where U®),U®),... 
are iid rvs that follow the copula Cp: Put 


H(u) := P (max U® < u) = C} (u). 


l<i<n 
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Each univariate margin of H is 


Hju) = P ( pax uP <u) =u", u € [0,1], 


I<i<r 


thus, the copula Cy corresponding to the continuous df H is 
=j =l n 1/n 1/n 
Calta, ..2jta) = H (H7 (u), ..., H3 (ua)) = CB (uj teg tly ie 


u = (u1,.-.-, ua) € [0, 1]¢. 
The next result completes the list of characterizations of a copula that 
belongs to the domain of attraction of a max-stable distribution. 


Corollary 3.1.12 A copula C on R°? satisfies C € D(G), G(x) = 
exp(— ||a||,), x <0 €R?, iff for any u € (0,1)? 


O” (ull) n> Calu) = G ( (G7) p1) = epl- llloglu)llp), 


where log(u) is meant componentwise. 


Proof. We know from Proposition 3.1.5 that the condition C € D(G) is equiv- 
alent to the expansion 


C(u) =1- |1- ullp + o(l|1 — ull) 
as u — 1, uniformly for u € [0, 1]%. As a consequence, we obtain 
on (w) = (1- i= I 
Choose u; = exp(z:), zi < 0, 1 < i < d. Then, the Taylor expansion 1 — 


exp(e) = —e + O(e?°) as e > 0 implies, with æ = (x1, ..., £q) and exp(æ) also 
meant componentwise, 


e (er) =e (E) 
eaaa a a 
EDRO 

iele) 


no exP(— ||2l| p) 
= exp(— ||log(u)|| p), 


+o(|1- ul 
D 


which completes the proof of the implication “=.” 
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Suppose on the other hand that, for u € (0,1), 
o” (u) n Calu) = exp(- ||loglu)||p). 
With u := exp(æ), x < 0 € R?, we obtain 
c” (exp (=)) noo expl- lle llp): 


Writing exp(#/n) = 1+a/n+ o(1/n), this becomes 


on (1 £2 po (+)) nace exP( llællp). 


on (1+2+0(=)) =" (1+ =) +001) 


as n — oo, which follows from the general bound 


But 


d 


for an arbitrary df F on R? with univariate margins F;, (see, for example, 
Reiss (1989, Lemma 2.2.6)), together with the fact that the copula C has 
uniform margins. We have thus established 


P T — 
C (1 + =) >n=> exp(— ||2||p) = G(x), 


x <0 € Rt, which completes the proof. 


EXPANSION OF SURVIVAL COPULA VIA DUAL D-NORM 
FUNCTION 


If a copula C satisfies C € D(G), G(x) = expl- ||æl|p), x < 0 € Rf, the 
survival function of C can be approximated by the dual D-norm function 
corresponding to ||:||p- This is the content of our next result. 

We have to clarify some notation first. Let ||-|| be an arbitrary D-norm 
on R? with generator Z = (Z1,..., Za). Choose a non-empty subset T C 
{1,...,d}, ie., T = {41,...,im} with 1 < i1 < --- < im < d, m = |T|. Then, 


m 
lullo, = |X wye] =E ES (ly; Z) ,  yeR”, 
j=l . 


is a D-norm on R™. It is the projection of ||-||5 onto ||-||p,,- Its dual D-norm 
function is 
Ry lp, = E ( min (|y;| Z) ; y eR”. 


1<j<m 
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Lemma 3.1.13 Let G be an SMS df with corresponding D-norm ||-|| p 
on R¢, and let U = (U;,...,Ua) be an rv that follows a copula C. Then, 
we have C € D(G) iff for every non-empty subset T = {i1,...,im} C 


at 


P(U;, > uj, 1<j<m) =U1-— utp, + 0(||1 — ull) 


as u = (1,...,Um) > 1 € R”, uniformly for u € [0,1]™. 


The proof of the preceding lemma shows that, for every T = {i1,..., im} C 

{1,...,d}, 
P (U > uj, 1 <j <m) =U1-— upr 

for u close to 1 € R”, if C is a GPC. 

The uniformity condition on u in the preceding result can be dropped for 
the reverse implication “<=.” 

Note that the survival probability P(U > u) of an rv U that follows a 
copula C, also known as a survival copula, is not a copula itself. 


Proof. We first establish the implication “=.” We can assume wlog that T = 
{1,...,d}. From Proposition 3.1.5, we obtain the expansion 


Clu) =1—|1 — ullo + o1 — ull) 


as u — 1, uniformly for u € [0,1], if C € D(G), G(x) = exp(—|a||p), 
x <0 € Rt. The inclusion—-exclusion principle in Corollary 1.6.2 implies 


P(UU >1-v) 
d 
-1-P (Üs = 
i=1 
=1- So (~I) PU; < 1-v,ieT) 
OAT C{1,...,d} 
=1- 5 (-1)7-? (:- S viei +o( X viei )) 
OAT C{l,...,d} iET D iET 
= >> p | X veil] + ollel) 
OAT C{I,...,d} iET D 


as v + 0, uniformly for v € [0, 1]%; recall that, by equation (1.10), 


yo Gps. 


ØZTC{1,...,d} 
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Choose a generator Z = (Z1,..., Za) of ||-|| p>. From Lemma 1.6.1, we obtain 
5 <1)" X viei 
OAT C{1,...,d} ieT D 
= YO Cate (max(Z)) 
OATC{I)..d} i 
=Ef E (OD marla) 
OATC{I,..d} i 
=E ( min (2) =RvNp. 
1<i<d 


Replacing v by 1 — u yields the assertion. 
The reverse implication can be seen as follows. Choose x > 0 € R? and 
s > 0. The inclusion—-exclusion principle in Corollary 1.6.2 implies 


1— C(1 — sax) 
s 
s 
P (Ui {U;>1- s) 
oe 
_ y (ait PU > 1- szr;, i €T) 
O4TCHI,...,d} s 
>a 2, CDHE (snin(e:2%)) 
OAT C{I,...,d} 


OAT C{Il,...,d} 


( max (e.2:)) = lel 


and thus, C € D(G) by Corollary 3.1.6. 


The following example is established in Charpentier and Segers (2009, 


1/p 
Theorem 4.1). With |æ], = (T Izl”) , p € [1,œ], we denote again 
the family of logistic norms. These are D-norms, as seen in Section 1.2, with 
limp>oo ||x||,, = læll 2; see Lemma 1.1.2. 
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Example 3.1.14 Take an arbitrary Archimedean copula on R 


Colu) = gt (plu) +--+ + plua)), 


where y is a continuous and strictly decreasing function from (0, 1] to 
[0,co) with y(1) = 0 (see, for example, McNeil and NeSlehova (2009, 
Theorem 2.2)). Let U = (Ui,...,Ua) follow this copula Cy. Suppose 
that 


(3.11) 


exists in [1,00]. Then, for æ > 0 € Rt, the survival copula satisfies 


. P(U; > 1-—szr;1<i<d) 
lim == 
s40 S 
wru =O, ifp=1, 
= (ULM, ifl<p<o, 


NE Noo = MiNI Tien Tat i p= oo. 


If p = 1, then the margins of C, are tail independent. This concerns 
both the Clayton copula and the Frank copula with generators y(t) = 
(t> — 1)/A, A > 0, and y(t) = — log((exp(—At) — 1)/(exp(—A) — 1)), 
A € R\ {0} respectively, but not the Gumbel copula with generator 
palt) = (—log(t))*, A > 1, in which case p = À. 


The preceding example gives rise to the conjecture that Cy € D(G,) under 
condition (3.11). This conjecture can easily be established. 


Corollary 3.1.15 Let C, be an arbitrary Archimedean copula on R? 
with generator y that satisfies condition (3.11). Then, Cy € D(Gp), 


where Gp is the standard maz-stable df with D-norm ||-||,, p € [1, x]. 


Proof. Suppose that the rv U = (Uj,...,Ua) follows the Archimedean cop- 

ula Cy. The distribution of an arbitrary subset (U;,,...,Uj,,), m < d, with 

different indices, is an Archimedean copula as well, but this time on R™: 
P (U; < Uiys+: sa Uin < Ui, ) 

= P (U, Sr Ube N pe < Uim; U; < T, 1<i< d) 


= 9" (Yuin) +++ Pin) 


as y(1) = 0. Since condition (3.11) does not depend on the dimension d, the 
preceding Example 3.1.14 also entails that, for x = (@1,...,%m) > 0 € R”, 


r P (Ui, 2 1 — sz1,..., Uin > 1— sam) 
lin =< sa 
sO S 
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Raw =0, if p= 1, 
= UE p, if 1< p< o0, (3.12) 
VTL = min {z1,..., £m}, if p=, 


where these dual D-norm functions are defined on R™. Lemma 3.1.13 now 
implies the assertion. o 


Example 3.1.16 (Continuation of Example 3.1.14) Let C, be an 
Archimedean copula on R?, whose generator function y : (0, 1] > [0, 00) 
satisfies with some so € (0, 1) 


= = =p; SE (0, So], (3.13) 


with p € [1, 00). Then, C, is a GPC, precisely, 
C(u) =1- |1- ull, u € [1 — so, 1}, 


where ||-||,, is the usual logistic norm; see Proposition 1.2.1. 
This is readily seen as follows. Condition (3.13) is equivalent with 
the equation 


(log(eA—s)) =Z, s € (0, so]. 


Integrating both sides implies 


log(p(1 — s)) — log(y(1 — so)) = plog(s) — plog(so) 


(38) oma ((S))> eto 


which yields 


i.e., 
p(s) =c(1—s)?, s € [1 — so, 1], 


with c := (1 — so)/s$. But this implies 
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THERE ARE STRICTLY MORE D-NORMS THAN COPULAS 


Let the rv U = (U1,..., Ua) follow a copula, i.e., each component U; is uni- 
formly distributed on (0,1). Since E(U;) = fo udu = 1/2, the rv Z := 2U 
generates a D-norm. 

Sklar’s theorem 3.1.1 may promote the idea that every D-norm can be 
generated this way. But this is not true. Take, for example, d = 2 and 
ll(z,y)||, = |z| + ly|. Suppose that there exists an rv U = (U1, U2) follow- 
ing a copula such that 


(zy), = 2E (max(|a|Ui,|y|U2)),  z,y ER. 
Putting x = y = 1, we obtain 


J= 2B(max(Uh, U2)), 


or 
B(1 — max(Uh, Ua) ) =0 
——S ee” 
€[0,1] 
and, thus, 
P(max(U1, U2) = 1) =i: 
But 


P(max(U1, U2) = 1) = PUY, = yu {U2 = 1}) 
< P(U; = 1) + P(Uz = 1) = 0. 


Moreover, it is obvious that ||-||, on R? with d > 3 cannot be generated 
by 2U, as ||(1,...,1)||, = d > 2E (maxı<i<a Ui). There are, consequently, 
strictly more D-norms than copulas. (Note that this is not meant in a strict 
mathematical sense.) 


3.2 Multivariate Piecing-Together 


It is by no means obvious to find a copula C that does not satisfy F € 
D(G) for some SMS df G. Counterexamples are provided in Section 3.3. As a 
consequence of the considerations in Section 3.1, we obtain that a copula C(u) 
can reasonably be approximated for u close to 1 € R? only by 1— ||1 — ul| p, 
with some D-norm ||-|| p. 

This message has the following implication: if you want to model the copula 
underlying multivariate data above some high threshold uo, a GPC is a first 
option, given in its upper tail by 


Q(u) =1-|/1-ull,, uo <u<1eER?. 3.14 
D 
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This idea is investigated in what follows. It turns out that it is actually possible 
to cut off the upper tail of a given copula C and to impute a GPC Q in such 
a way that the result is again a copula. 

Note that 


Õ(u) := max(1—|[1-ully,0),  O<Sust, 


defines a copula only in dimension d = 2; for details, we refer the reader to 
Falk et al. (2011, Section 5.2). 


UNIVARIATE PEAKS-OVER-THRESHOLD APPROACH 


As shown in (2.6), the upper tail of a univariate df F can reasonably be 
approximated only by that of a GPD, which leads to the (univariate) peaks- 
over-threshold approach (POT): For a univariate rv X with df F, set 


F(a) — F(2o) 


Flol(¢) = P(X < z | X > zo) = I Fe 
=P as 


XL > Xo, 


where we require F (xo) < 1. The univariate POT is the approximation of the 
upper tail of F by that of aGPD H 


F(x) = (1 — F(ao)) FP] (x) + F(xo) 
por (1— F(#o))Ha,p,o(@) + F (Xo), © > Xo, 


where a, u, and o are shape, location and scale parameters of the GPD 
H respectively. Recall that the family of univariate standardized GPDs is 
given by 


The preceding considerations lead to the univariate piecing-together ap- 
proach (PT), by which the underlying df F is replaced by 


* = F(z), £ < Xo, 
Fo (2) = -oa o, (3.15) 


typically in a continuous manner. This approach is aimed at an investigation 
of the upper end of F beyond observed data. Replacing F in (3.15) by the 
empirical df of the data provides in particular a semiparametric approach to 
the estimation of high quantiles; see, for example, Reiss and Thomas (2007, 
Section 2.3). 
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MULTIVARIATE PIECING- TOGETHER 


A multivariate extension of the univariate PT approach was developed in 
Aulbach et al. (2012) and, for illustration, applied to operational loss data. 
This approach is based on the idea that a multivariate df F can be decomposed 
by Sklar’s theorem 3.1.1 into its copula C and its marginal df. The multivariate 
PT approach then consists of the two steps: 


(i) The upper tail of the given d-dimensional copula C is cut off and sub- 
stituted by a GPC in a continuous manner, such that the result is again 
a copula, called a PT copula. Figure 3.1 illustrates this approach in the 
bivariate case: the copula C' is replaced in the upper right rectangle of 
the unit square by a GPC Q; the lower part of C is kept in the lower left 
rectangle, whereas the other two rectangles are needed for a continuous 
transition from C to Q. 


(ii) Univariate df Fy',..., FZ are injected into the resulting copula. 
(0, 1) (1,1) 
Q 
| C 
(0, 0) (1, 0) 


Fig. 3.1: Multivariate piecing-together. 


Taken as a whole, this approach provides a multivariate df with prescribed 
margins F*, whose copula coincides in its lower and central parts with C and 
in its upper tail with a GPC. 
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Let U = (U1,...,U4a) follow an arbitrary copula C and V = (Vi,..., Va) 
follow a GPC. Let Z be a generator of the corresponding D-norm ||-||p. We 
suppose that U and V are independent. 

Choose a threshold u = (u1,..., ua) € [uo, 1] and put for 1 <i < d 


The rv Y = (Y1, ..., Ya) actually follows a GPC; the following result provides 
a precise characterization of the corresponding D-norm as well. 


Theorem 3.2.1 Suppose that P(U > u) > 0. The rv Y defined 
through (3.16) follows a GPC, which coincides with C on [0, u] C [0, 1]%, 
and whose D-norm is given by 


= 1(U; > uj) d 
lelo =£ (max, (lo) ) were 


Uj 


where Z and U are independent. 


Note that Z := (Zi,...,Za), with Z; := Z;1(U; > u,;)/(1 — uj) is a 
generator of a D-norm, due to the independence of Z and U. As X} := 1(U; > 
u,;)/(1 — uj) is non-negative and has expectation one, X = (X1,..., Xa) is 
the generator of a D-norm itself, and consequently, Z is the generator of the 
product of the D-norms, as investigated in Section 1.9. 


Proof. Elementary computations yield 
P(Y; <2)=42, 0<2<l, 


i.e., Y follows a copula. We have, moreover, for O < a < u, 


P(Y <a) 

= 5 P (Y <a; Ur < urk € K; Uj > ujj €K®) 
KC{1,...,d} 

=- 5 P(UAU: < ui) + (ui + (1 — uv; > ui) < 241 <i < d: 
KC{1,...,d} 


Up < ukk © K; U; > uj,j € K?) 
= C(x) 


and, for u < æ < 1, 
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= es P (Y < x; Us < uk € K; U; > ujj € K?) 


= 5 P (U Sun, k € K; uj + (1 — uy)Vj < 03,0; > uj, j € K?) 


II 
rM 
Y 


eae: tj- Uy. of 
(Us < uk € Ki; > ujj EK) P (v < B= 4 je x?) 
a 


= E ( utr. <u) ( T 10>) 
KC{1,...,d} kek jeKe 


«P(ysS=4jen®), 
1l—u; 


If x < 1 is close enough to 1, then we have, for Ke # Ô, 


p(ys 


and, thus, using the independence of U and Z, 
P(Y < x) 


=1- © e((1 TEN) (mx 1(U; >u) 
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1-5 y , (11 TEN) (1 1U; >w) 


jEKe 


Jj — 1| 
x ax, ( = Z;j1(U; > uj) 


x< 5 , (11 1(Ux <w) (1 1(U; 29) 


KC{1,.; kek jEKES 
Kozo 
LU. i 
=1-— E| max [ |r; -1|Z ee) 1-—1(U; < uj, 1< j <d)) 
j i= J> uj 
j 


where we have used the identity 


: m (11 (Uk < w) ( I] 14> w) = 


Ke {less kek jeKe 


This completes the proof of Theorem 3.2.1. 


The following result justifies the use of the multivariate PT approach, as 
it shows that the PT vector Y, suitably standardized, approximately follows 
the distribution of U close to one. 


Proposition 3.2.2 Suppose that U = (Uj,...,Ua) follows a copula 
C € D(G) with corresponding D-norm ||-|| generated by Z. If the rv 
V in the definition (3.16) of the PT vector Y has this generator Z as 
well, then we have 


P(U > v) = P(Y; > wy +0j(1—w)), 1S j <d|U > uw) + ollt- vll) 
= P(V > v) + o(l|1 — vll), 


uniformly for v € [u,1] C R°. 


The term o(||1 — v||) can be dropped in the preceding result if C is a GPC 
itself, precisely if C(v) = 1 — ||1 — vl||p, v € [u, 1] c R°. 
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Proof. From Lemma 3.1.13, we obtain the expansion 


P(U >v)= E ( min ((1 — v)23)) + o(||1 — vl) 


1<j<d 
uniformly for v € [0,1]. On the other hand, we have for v close enough to 1, 
P(Y; > uj +uj(l—uj), 1<jsd|U>u)=PV; >,1<5 <4) 


=E (mia (a = 05)25)) 


where the final equation follows from (2.16). This completes the proof. 


If the copula C is not known, the preceding PT approach can be modi- 
fied by replacing C with the empirical copula; see Aulbach et al. (20126) for 
details. 


3.3 Copulas Not in the Domain of Attraction 


It is by no means obvious to find a copula C that does not satisfy C € D(G) 
for some SMS df G. An example is given in Kortschak and Albrecher (2009). 

The following result provides a parametric family of bivariate rvs that are 
easy to simulate. Each member of this family, whose parameter is different 
from zero, has the property that its corresponding copula does not satisfy the 
extreme value condition (3.8). These bivariate copulas can easily be used to 
construct copulas in arbitrary dimension that are not in the domain of an 
SMS df; just add independent components. 


Lemma 3.3.1 Let V be an rv with df Hy(u) := u(1 + Asin(log(u))), 
0<u<1, A€ [-1/V2,1/V2]. Note that H\(0) = 0, Hy)(1) = 1, and 
H; (u) > 0 for0<u<1. Furthermore, let the ru U be independent of 
V and uniformly distributed on (0,1). Put Sı := U =: 1— S2. Then, 
the copula C) corresponding to the bivariate rv 


2 


Koes (= : ) e (—oo, 0]? (3-17) 


Si So 


is not in the domain of attraction of a multivariate EVD if A # 0, 
whereas Co € D(G) with corresponding D-norm 


_ [æl |z2] 


I 


for x = (xı, x2) £0. 
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Denote by Fy the df of —-V/ S1 =p —V/S . Elementary computations yield 


(agra fest, 


F. = 
ala) 1— |z| (3 + 4(2sin(log |x|) — cos(log |x|) ). if—1< x< 0, 


thus, F is continuous and strictly increasing on (—oo, 0]. 
Proof (of Lemma 3.3.1). We show that 


1— — =4 
lim Ca(l-—t,1-—t) 
t10 t 


does not exist for A € [—1/V2, 1/,/2]\ {0}. Since Cy coincides with the copula 
of 2X, we obtain 


1—Cy(Fy(s), Fa(s)) _ 1— P(-V/Si < s,-V/S2 < 8) 
1 — Fy(s) 1— P(-V/S, < 8) 
1— P(V > |s|max(U,1—U)) 
1—P(V > |s|U) 
i P(V < |s|max(u,1 — u)) du 
J, P(V <|s|u) du 
Jo’? Ha(|s| (1 u)) du + fi Ha(\s] u) du 
fo Hy (|s| u) du 
7 Jin H, (|s| u) du 
7 J, H (|s| u) du l 


The substitution u > u/ |s| yields 


| LLOG) Pl) Siy Hau) du 


2 1— F(s) J! H, (u) du 
_ eee Hy(u) du 
JE! Hy (u) du ` 


where we have for each0 < c< 1 


(i 2 c 
I H, (u) du = S + af usin(log(u)) du 
0 0 


and 


N 


C 


n u sin(log(u)) du = = (2 sin(log(c)) — cos(log(c))), 
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which can be seen by applying integration by parts twice. Hence, we obtain 


i H, (u) du 
ie Ay (u) du 
zl 4 + a(2 sin(log |s| — log(2)) — cos(log |s| — log(2))) 
4 + + 4(2sin(log|s|) — cos(log |s|)) ; 
whose limit does not exist for s fî 0 if A € [1/V2,1/V2] \ {0}; consider, e.g., 
the sequences sP = —exp((1 — 2n)7) and s?) = —exp((1/2 — 2n)r) as 
n—> o. 


On the other hand, elementary computations for x = (x1, £2) € (—oo, 0]?\ 
{0} show 


|x1| |x2| 


lim 1—Co(1 + tax) 
ain 


jm = 2E(max(|x1| S1, |x2| S2)) = ||all, — 


Corollary 3.1.6 now implies that Co € D(G), with the corresponding D-norm 
being the above limit. 


® 


Check for 
updates 


A 


An Introduction to Functional Extreme Value 
Theory 


The extension of D-norms to functional spaces in Section 1.10 provides a 
smooth approach to functional extreme value theory, in particular to general- 
ized Pareto processes and max-stable processes. Multivariate max-stable dfs 
were introduced in Section 2.3 by means of generalized Pareto distributions. 
We repeat this approach and introduce max-stable processes via generalized 
Pareto processes. In Section 4.3, we show how to generate max-stable pro- 
cesses via SMS rvs. This approach, which generalizes the max-linear model 
established by Wang and Stoev (2011), entails the prediction of max-stable 
processes in space, not in time. The Brown-—Resnick process is a prominent 
example. 


4.1 Generalized Pareto Processes 


In this section, we extend the simple multivariate generalized Pareto distri- 
bution as defined in equation (2.9) in an obvious way to functional space. 


DEFINING A SIMPLE GENERALIZED PARETO PROCESS 


Let Z = (Z:)tejo,1, be the generator of a functional D-norm ||:||p on E[0, 1] 
with the additional property 


Zi<c, té€ (0,1, (4.1) 


for some constant c > 1. For each functional D-norm, there exists a generator 
with this additional property; see Theorem 1.10.8. Let U be an rv that is 
uniformly distributed on (0,1) and that is independent of Z. Put 


1 1 
V := (Vijeto, = i (Z)te[0,1) = y (4.2) 
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Denote using [0, cll! := {f : [0,1] — [0, c]} the set of all functions from the 
interval [0, 1] to the interval [0, c]. Repeating the arguments in equation (2.11), 
we obtain, for g € E[0,1] with g(t) > c, t € [0,1], 


PW <o=P(v> Že, t€ [0.tl) 


z S a (v >—,,te 0.11) (P * Z) d ((2)te[o,1)) 


i his on” (v =n (5) (P * Z) d ((zi)telon1) 


Zt 
= 1—-P|UK< su — PxZ)d((z 
f., [0,1] ( oa Bor (5) ( ) (( t)t€[0,1)) 
at 


=1- f sup (=) PxZ)d((z 
[0,c}4 tejo,1] \ g(t) ( ) d ((zt)ee(o,1)) 


Lt 
-1-8 ( s» (2)) 
=1-|1/gllp; (4.3) 


i.e., the functional df of the process V is given by 1 — ||1/g||p if g is pointwise 
larger than c. We have, moreover, 


1 
PV St) aI z, x >c, te [0,1], 


i.e., each marginal df of the process V is equal to the standard Pareto distri- 
bution in its upper tail. Therefore, we call the process V a simple generalized 
Pareto process (GPP); see Ferreira and de Haan (2014) and Dombry and 
Ribatet (2015) for detailed discussions. 


SURVIVAL FUNCTION OF A SIMPLE GPP 


The following result extends the survival function of a multivariate GPD as 
in equation (2.12) to simple GPP. The dual D-norm function corresponding 
to a functional D-norm was introduced in (1.28). 


Proposition 4.1.1 Let Z = (Z;)tejo,1] be the generator of a functional 
D-norm ||:||p with the additional property ||Z||,, < c for some constant 
c>1. Then, for g € E[0,1] with g(t) > c, t € [0,1], we obtain 


PV > 4) = PV > 9) = E( inf (£) =V 1/gùp. 


te[0,1] \ g(t) 
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Proof. Repeating the arguments in equation (4.3), we obtain 


P(V > g) = = P (v < gt)’ te [0, i!) (P * Z) d ((2)te(0,1)) 


7 h glo j (v o (=) (P * Z) d ((2)se(0,u)) 
~ ha ajlo rie ( =) (P * Z)d((z)te(o,1)) 


a3) 


EXCURSION STABILITY OF A SIMPLE GPP 


The following result extends the excursion stability of a multivariate simple 
GPD in equation (2.13) to a simple GPP. 


Corollary 4.1.2 Under the conditions of Proposition 4.1.1 and the ad- 
ditional condition E (infrejo,1] Zi) > 0, we obtain 


1 
ENV O tar 


Proof. We have 


> t9, V > 
P(V > tg|V > g) = EV ZV 2 9) 


P(V >g) 
P(V > tg) 
~ PW 9) 
— RI/(tg) tp 1 
~ Q1/gtp t 


The conditional excursion probability P(V > tg|V > g) = 1/t, t > 1, 
does not depend on g. We therefore call the process V excursion stable. 


SOJOURN TIME OF A SIMPLE GPP 
The expected sojourn time of a simple GPP provides another example of its 


excursion stability. The time that the simple GPP V = (V;)rejo,1} spends 
above the function g € E[0,1], g > c > 1, called sojourn time above g, is 


164 4 An Introduction to Functional Extreme Value Theory 
1 
sr(a)= | 1(g(t),00) (Ve) dt. 
From Fubini’s theorem, we obtain 
1 
B(T) = E( f tioa (Vo at) 
1 
= f E (ioo V) d 
1 
= | PV >swyat 
0 
> il 


=| am! 


Recall that P(V; < x) =1—1/z,2>c,t € [0,1]. 
By choosing the constant function g(t) := s > c, we obtain for the expected 
sojourn time of the process V above the constant s 


BIST) =B( f tuod) = 4 


Given that the sojourn time S'T(s) is positive, this implies for the conditional 
expectation of the sojourn time the equation 


E(ST(s) | ST(s) > 0) = Ea 
B 1/s 
1- P(V; < s,t € [0,1]) 
1 
= (4.4) 


independent of s > c. Different than the multivariate case, where we denote 
the vector (1,...,1) € R? with constant entry 1 using boldface type 1, the 
constant function, with value 1 on [0,1], is denoted using regular type 1. 

Interestingly, the number ||1||p is introduced in equation (2.28) in the 
multivariate case as a measure of tail dependence between the margins of the 
underlying multivariate df. The tail dependence increases if ||1||,, decreases, 
with ||1||p = 1 being its minimum, attained for ||-||p = Illo- 

From Lemma 1.10.2, we know that in the functional case 


Ito 2 WA. = 1, 


and thus, E(ST(s) | ST(s) > 0) increases with decreasing ||1|| p; its maximum 
value is one in the case ||1||,, = 1, which characterizes the functional sup-norm 
\|-\|,, by the functional version of Takahashi’s Theorem in Corollary 1.10.5. 
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4.2 Max-Stable Processes 


Let VO,V@),... be a sequence of independent copies of V = Z/U, where 
the generator Z satisfies the additional boundedness condition (4.1). We ob- 
tain, for g € E[0,1], g > 0, and large n € N, 


n 1<i<n 


1 ; . 
pà max VO <q) =P(VO <ng1<i<n) 


i 


1 
noo EXP (- |; 
g 


a) 


where the mathematical operations maxı<i<n AA etc., are taken compo- 
nentwise. The above reasoning is strict if infyeio,1) g(t) > 0. Otherwise, check 
that the above convergence is still true with the limit exp(— ||1/g||,) = 0. 


Next, we ask: Is there a stochastic process € = (&)1¢19,1 0” [0,1] with 


Pe <a)=e(-|- 


i g € E[0,1], g > 0? 
D 


If € actually exists: Does it have continuous sample paths? 

If such £ exists, it is a maz-stable process: let £), €@),... be a sequence of 
independent copies of the process €. Then we obtain, for arbitrary g € E[0, 1], 
g > 0, and any n € N, 


P G max e < o) =P E E0 < ng) 


n 1<i<n 1<i<n 


=P(€ <ng,1<i<n) 


= [IP (6° < no) 


41 
= P (£ < ng) 
1 n 
-e (- fal) 
Cl) 
= exp | — |/- 
I\lp 


= P(é < g). 


Such processes € actually exist, see equation (4.7). 
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STANDARD MAX-STABLE PROCESSES 


We denote using E~ [0,1] := {f € E[0,1]: f <0} the set of those functions 
in E[0, 1], that attain only non-positive values. 


Definition 4.2.1 Let n = (m)rejo,1] be a stochastic process in C[0, 1], 
with the additional property that each component m follows the stan- 
dard negative exponential distribution exp(x), x < 0. Let n®, n®,... 
be independent copies of 7. We call the process 7 an SMS process if, 


for arbitrary f € E` [0,1] and any n € N, 


(i) = 
È (r n r) = P(n < f). 


A proper choice of f, as in (1.29), shows that each finite dimensional 
margin (71,,---,%ta), 0 < tı <- < ta < d, d E N, of an SMS process 9 
follows an SMS df as in Definition 2.3.2. 

The following result, which goes back to Giné et al. (1990), can now be 
formulated in terms of the functional D-norm. We do not provide a proof 
here, but refer the reader to Giné et al. (1990) instead. 


Theorem 4.2.2 A process ņ in C[0,1] is an SMS process iff there exists 
a D-norm ||:||p on E[0,1], such that 


P(n < f)=exp(-llfllp), f € Æ- [0,1]. 


The preceding result immediately entails, for example, that the rv X := 
SUPzejo,1] Mt follows a negative exponential distribution: 


P(X <2) = P (m < z, t € (0,1) 
= P(n< x1) 
= exp (— ||21||p) 
=exp(zlllp), = «<0, 
i.e., the rv X = sup;¢jo,1] 7 is Negative exponentially distributed 


P(X < x) = exp(z/ð), z <0, 


with parameter V = 1/ ||L||p. Recall that ņ has continuous sample paths; as 
a consequence, we obtain in particular 


P(m = 0 for some t € (0, 1]) =P ( sup 7% = o) 
te[0,1] 


=1-— P(X <0) 
=1-—P(X <0)=0. (4.5) 
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This observation can be extended considerably as follows. Choose f € 
E- [0,1]. Then 


P(m = f(t) for some t € [0,1], n < f) =0. (4.6) 


With f = 0 we obtain equation (4.5) again. Equation (4.6) is an immediate 
consequence of the next result, by observing that 


P(m = f(t) for some t € [0,1], n < f) = P n < f}\{n < f}) 
= P(n < f)- P(n < f)=0. 


Lemma 4.2.3 For an arbitrary SMS process n with corresponding D- 
norm ||-|| 5, we have 


P(n < f) = P(n < f) = exp (- ||fllo);> £ € £~ [0,1]. 


Proof. Choose f € E~ [0,1]. Corollary 1.10.3 implies that 


1 
A noo 
b “|. selz 


thus, we obtain from the continuity of probability theorem 


ra<n>r(U {n<s-2}) 
nen 
= lim P < 1 
e 
1 
= time (- -al 
= exp (- IIfllp)- 


Because 
P(n < f) < P(n < f) =exp(—|Ifllp), 


the assertion follows. 


SIMPLE MAX-STABLE PROCESS 


As P(n < 0) = 1 by (4.5), we can put 
é:=—-. (4.7) 


The process € = (£;)+e[o,1) has continuous sample paths, each margin € is 
standard Fréchet distributed 
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Pl <y) =P(ms-=) =e(-4), y > 0, 
y y 


iL.) 
Clas 


PE< o) =P (n<>) = exp -E 


and, for g € E[0, 1], g > 0, we have 


The process € is, consequently, max-stable as well. It is called simple maz- 
stable in the literature. 


GENERATION OF SMS PROCESSES 


In Proposition 2.4.1 we showed how to generate an SMS rv on Rê via inde- 
pendent copies of a generator of the corresponding D-norm. This approach 
can be repeated to generate an SMS process. 


Proposition 4.2.4 Let Z® , i €N, be independent copies of a bounded 
generator process Z = (Zt)tejo,1] of a functional D-norm ||-||p, and let 
Eı, E2,... be tid standard exponential rvs, also independent of the se- 
quence Z®, Z),... Put Vj :=1/ Xp; Ex, i € N. Then, the stochastic 
process 


1 1 


7 rc03 = EO) (Sapien (VE) 


te [0,1] 


is an SMS process with 


P(n < f)=exp(-Ilfllp), £ € Æ- [0,1]. 


The condition that Z is bounded can be dropped in the preceding result, 
see the proof of equation (9.4.6) in de Haan and Ferreira (2006). 


Proof. If we know already that the process ņ has continuous sample paths, 
then we obtain, for f € E~ [0,1], using the continuity of probability theorem 
again, 


P(n<f)=P (A {nu < sa) = lim P (À {nu < se) l 
iEN i=1 


where {t1,t2,...} is a dense subset of [0,1] that also contains the finitely 
many points t € [0,1] at which the function f is discontinuous. From Propo- 
sition 2.4.1, we obtain that 


d 
1 


Nty,..ta = (Mas -3 Ma) = — — So 
SUDjEeN (vizi ) 


4.2 Max-Stable Processes 169 


is an SMS rv in R? for arbitrary indices 0 < tı < --- < ta < 1 and d E€ N 
with df 


P (na, ,...ta S £) = exp (- lællo,, $ 


fated 


The dominated convergence theorem, together with the fact that Z has 
continuous sample paths, implies 


P (À {m, < sen) = P (na, ta S (Fa) 


i=l 


= exp (= MEE) FCD 4, ) 


pasay 


= exp (-z (px max AF CE; ;)| Zn))) 
=n CXP (-« (se (FÆI z3) 


=exp(- ||fllp)- 


Therefore, we have established 


P(n < f)=exp(-lIfllp), f € EF [0,1]. 


It remains to show that ņ has continuous sample paths. Put E(t) := 
SUPjcn (viz), t € [0,1]. We show 


lim inf E(t) > &(to), 


and 
lim sup €(t) < E(to) 


t-to 


for each to € [0,1] with probability one. This implies pathwise continuity of 


the process € = (&(¢))se[0,1]- 
Recall that we require boundedness of Z, i.e., sup;ejo,1) Zt < c for some 
number c > 1. For any M € N, we have 


(2) 
=m m z® TAN 
E(t) = max (z ax (vizi ) , sup =) 


i>M41 hy 
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IA 


maxi<i<M (viz?) + STE, 


; k=1 
2 Maxi<i<M (vz) . 


vV 


The continuity of each Zo implies 
Pe 9 
> . 
Hmjn 6) > max, (Vizi 


for each M € N; thus, 


lim inf €(¢) > sup (Viz?) : 


t—+to icN 
On the other hand, 
limsup¿(t) < max (w2) one 
tto T 1<i<M a MEL T 


thus, by the law of large numbers, 


lim sup E(t) < sup (viz?) 
t-to iEN 

for each to € [0,1], with probability one. This shows that the process € has 

continuous sample paths and, therefore, the process n = —1/€ as well. Note 

that P( > 0) = 1, which can easily be seen using the fact that € is a max- 

stable process, as in the proof of equation (9.4.6) in de Haan and Ferreira 

(2006). 


SURVIVAL PROBABILITY OF SMS PROCESS 


We can easily extend the bounds in Lemma 2.4.2 for the survival probability 
of an SMS rv in R? to an SMS process. 


Lemma 4.2.5 Letn be an SMS process with a corresponding functional 
D-norm ||:||p. For f € E~ [0,1], we have 


(i) Pn > f)21—exp(— Uf Xp), 


(i) limsyo EZP = x f ùp. 


S 


Proof. Choose f € E~ [0,1], and let {t1,t2,...} be a dense set in [0, 1], which 
also contains the finitely many points t € [0,1] at which f is discontinuous. 

We can assume wlog that sup;ejo,1] f(t) =: K < 0; otherwise, the prob- 
ability P(ņ > f) would be zero and parts (i) and (ii) of Lemma 4.2.5 are 
obviously true as X} f Rp = 0. From Lemma 2.4.2 and the continuity of 7, we 
obtain, for € € (0, |K]), 
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P(n > f)= (Atm > f(t +e) 


iEN 


= i P (f) tn > f(t D+) 


> liminf (1-exp (-2(.m min (|f(ti) + el Ze, )))) 


=1-exp (—2 ( int (uit) +120). 


Letting £ converge to zero, we obtain from the dominated convergence theorem 


Pm > f)21-exp (-E (int, (7120), 


which is part (i) of Lemma 4.2.5. 
Next, we establish the inequality 


Himsup ZS. < p ( min (FEZ). neN. 


s}0 sS 1<i<n 


The inclusion—exclusion principle implies 


P(n > sf) <P (À {m > “tt 


Ste (U {m < sit) 

=1- Sl (17 *P(m, < flt; j ET) 
OAT C{l,...,n} 

et ent exp | —sE | max (LE) Zi) 
Mo ( (a5 )) 


II 


:1— H(s) = H(0) — H(s) 


by equation (1.10). 
The function H is differentiable; thus, 


fim sup 2002 2f)  — tion Hl9) = HO) 
sO S sO S 
= -H'(0) 


= Ð (DME (mex (70691 ,)) 


ZTC{1,...n} 
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=E 5 (DITI max (IEE) Ze) 


OAT C{l,...,n} 


according to Lemma 1.6.1. Letting n tend to infinity, the dominated conver- 
gence theorem implies 


P(n > i 
hip N <E ( inf (|f(¢)| z1) =} ftp. 
s}0 8 te [0,1] 


The Taylor expansion exp(x) = 1 + x + o(x) for x > 0, together with the 
lower bound in part (i), implies 


P I= = 
iot 8) peice E 
sO S s40 s 
sin AP) 
sO S 
=ù f Xb, 


which completes the proof of part (ii) and, thus, of Lemma 4.2.5. 


It is easy to find an SMS process 7 and f € E7 (0, 1] with a strict inequality 
in part (i) of Lemma 4.2.5; see the next example. This construction of an 
SMS process is a particular example of a maz-linear model discussed and 
generalized in Section 4.3. 


Example 4.2.6 (Simple Max-Linear Model) Take two indepen- 
dent and identically standard negative exponentially distributed rvs no, 
m, and put, for t € (0,1), 


Mi := max M o ; 
IE 


Then, the process 7 := (7)+e[o,1) is continuous and satisfies for f € 
E- [0,1] 
P(n < f) = exp (- || flip); (4.8) 


where the functional D-norm ||-||p is generated by the process 


Z = (Zi)tejo,1] := (max ((1 — t) Zo, tZ1))rejo,1]> 


with Zo € {0,2}, P(Zo 0) P(Zo 2) 1/2 and Zı := 2 — Zo. 

We have min(Zo, Z1) = 0 and, consequently, X f tp = 0, f € E[0, 1]. 
The lower bound in part (i) of Lemma 4.2.5 is, therefore, zero, which is 
less helpful: 
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P(n > f) 2 l>exp Uf tp) =1—exp(0) = 0. 


The function f is negative and continuous, and we obtain 


P(n>f)=P (max (E z) > f(t), t € [0, n) 


1-t 
= P(m >—1,m > —1) 
= P(m > -1)? 
exp(—2) > 0. 


> e T > f(t), te [0,1/2]; > f(t), tE (1/2, 11) 


Although it is a bit uncommon, we add a proof of the preceding example. 


Proof (of Example 4.2.6). The process Z is non-negative, pointwise not larger 
than 2, and satisfies for each t € [0, 1] 


PG) = 2(1 = HP(Zo = 2) + 2P(Zo = 0) = 1. 


It is, therefore, the generator of a functional D-norm ||-||p. We have, for 
f € E~ (0,1, 
IIfllp =£ ( sup (|f()| zo) 

te[0,1] 


tefo, te[0,1] 


=E ( sup (FOIZ) (Zo = ») +E ( sup (|f(t)| Ze) 1(Zo = ») 


=2 ( sup ((1 = t) son) P(Z =2)+2 ( sup eron) P(Zo = 0) 


t€[0,1 te€ [0,1] 
= sup ((1—t)|f()|) + sup IFAN 

te [0,1] te [0,1] 
= (apeos gp eo) 


We have, moreover, 
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=P (m < inf (1 - Dro) P (m < inf so) 


€[0,1] te [0,1] 
— (, int (1 — nfo) a ( int (©) 
= exp ( int (0-940) + int (FO) 
seati 


which proves equation (4.8). 


THE RANGE OF THE COMPONENTS OF AN SMS PROCESS 


By repeating the arguments in the proof of Lemma 2.5.5 word for word, we 
can extend it to a functional version. 


Lemma 4.2.7 Letn = (7)iejo,1) be an SMS process with corresponding 
D-norm generated by the process Z = (Zi)tejo, ForO0<a<b< 1l, 
we have the bound 


E| sup |m- ns] = E ( max m) = E (min m) 
s,tE€ [a,b] tE[a,b] tE[a,b] 


1 1 


ZE (minteja,t] Zi) T E (maxte[a,b] Zi) ` 


With a = 0 and b = 1, this bound becomes 


il 1 
E| sup |m- n|] <——-——. 
(22, RIND ILo 


Example 4.2.6 shows that 2 1 Xp can be zero, in which case the preceding 
upper bound is not helpful. 

Clearly, the process 7 has continuous sample paths in our setup. But it is 
worth mentioning, on the other hand, that the upper bound in Lemma 4.2.7 
implies continuity in probability of n, i.e., P (Ine — ns| > €) —>t—s 0, for each 
s € [0,1]: the pathwise continuity of Z, together with the dominated conver- 
gence theorem, yields 


lm E ( min 21) = E(Z;) =1= lim E (sna z) : 
b—al0 tE[a,b] b—al0 tE [a,b] 
if a < s < b. Markov’s inequality, together with Lemma 4.2.7, then implies 
1 
P| sup |m—ns| 2e] SE | sup |m — ns| | —>v-a0 0, 
tE[a,b] E t€[a,b] 


ifa < s <b. 
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4.3 Generalized Max-Linear Models 


We propose a way how to generate an SMS process in C[0,1] from an SMS 
rv in R by generalizing the max-linear model established by Wang and 
Stoev (2011). For this purpose, an interpolation technique that preserves max- 
stability is proposed. It turns out that if the rv follows some finite dimensional 
distribution of some initial SMS process, the approximating processes con- 
verge uniformly to the original process and the pointwise mean squared error 
can be represented in a closed form. This method enables the reconstruction 
of the initial process only from a finite set of observation points, and thus, 
reasonable prediction of max-stable processes in space, not in time, becomes 
possible. The Brown—Resnick process is a prominent example. 


THE GENERALIZED MAX-LINEAR MODEL 
Let X = (Xo,...,Xa) be an SMS rv with pertaining D-norm ||-|| p, on 
Rit! generated by Z = (Zo,..., Za), d € N, i.e., 


P(X < £) = exp (- lælos,. a) = exp (-z (max, id 2)) 


siete 


x = (z0,..., £a) < 0. Choose arbitrary deterministic functions go,...,ga € 
CT [0,1] := {g € C[0, 1] : g > 0} with the property 
II(go(t),---,9al)Ilpo a51 tE [0,1]. (4.9) 


For instance, in the case of independent margins of X, we have ||-|| p, 


||-||,, and condition (4.9) becomes 


d 
Sa) =1, tE (0, 1], 
i=0 
i.e., gi(t), i =0,...,d, defines a probability distribution on the set {0,...,d} 
for each t € [0,1]. This is the setup in the maz-linear model introduced by 
Wang and Stoev (2011). An example is given by the binomial distribution 


gi(t) := (‘) Migs, i=0,...,d, t € [0,1]. 


a 


ho, ..., ha € C+[0, 1], which satisfy X52, h(t) > 0 for t € [0,1]. Then, 


(ho(t), ..-, ha(t)) 
|(ho(t), ---, hat) Il, 


(go(t), sig , ga(t)) = ’ tE [0, 1], 
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satisfies condition (4.9). Particularly helpful functions gj,...,g% are defined 
n (4.12). 
Now, for t € [0,1], put 


m= (4.10) 


The model (4.10) is called the generalized maxz-linear model. It defines an SMS 
process, as the next lemma shows. 


Lemma 4.3.1 The stochastic process n = (nt )eeto, 1) in (4.10) defines 
an SMS process with generator process Z= (Zia given by 


(9%), telo 


In model (4.10) we have not made any further assumptions on the D-norm 
Illo, a that is, on the dependence structure of the rv Xo,...,Xa. The 


= ||-||, characterizes the independence of Xo,..., Xa. 


pene 


gerne 


dence Xp = --:: = Xa a.s., with the po Pa eater Zo = = Za =l. 


giaa 


Z= max (gi(t)Z;) = Pee gi(t) = 1, te [0,1]. 
Proof (of Lemma 4.3.1). At first, we verify that the process Z is indeed a 
generator process. It is obvious that the sample paths of Z are in C* [0,1], 
owing to the continuity of each gi. Furthermore, for each t € [0,1], we have 
by construction 


E (2) = II(go(t),---,9a(t))Ilp, 4g = 1. 


As ||-||,, < |||) for an arbitrary D-norm, we have ||(go(t),.-., ga(t)) ll, < 1, 
t € [0,1], and, thus, Z < maxj=o,...a Zi, t € [0,1]. 
In addition, we have for f € E~ (0, 1] 


P(n < f) 
= P(X; < git) f(t), i =0,...,d, t € [0,1]) 


=P(x< inf (gt) f(t), i=0,....4) 


te [0,1] 


= (x, <= aap GA FON. i=0,...,4) 


te [0,1] 


( sup (go(t) |f(#)|),-.-, sup (ga(t) so) 


te [0,1] "te [0,1] 


= exp - 
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= o0(-#( er ( ap, (00100 )4))) 


~ 
Il 
(= 


S (-e( 2x, (rol 2))) 


which completes the proof. 


IF CONDITION (4.9) Is DROPPED 


Condition (4.9) ensures that the univariate margins m, t € [0, 1], of the process 
7 in model (4.10) follow the standard negative exponential distribution P(n < 
x) = exp(x), x < 0. If we drop this condition, we still obtain a max-stable 
process: for n € N, take iid copies n™,...,7'™ of 7, defined in (4.10). For 
f € ET [0,1], we have 


P ¢ max 7‘) < r) 


1<k<n 
=P(x< inf (2070). i=0,....d) 
te [0,1] n 
= exp | — | ( sup (DIOD sup Cael) FC) 
telo] l Do,...a 
= P(n < f). 
The univariate margins of 7 are now given by 
P(m < 2) = exp (Igo); -+ a)l, a 2) > (4.11) 


for x < 0 and t € (0, 1]. 


RECONSTRUCTION OF SMS PROCESS 


The preceding approach offers a way to reconstruct an SMS process in such 

a way that the reconstruction is again an SMS process. Let 7 = (n)rejo,1] 

be an SMS process with generator process Z = (Zt)tejo,1] and D-norm ||-|| p- 

Choose a grid 0 =: sọ < sı < +- < Sq-1 < Sa := 1 of points within [0,1]. 

Then, X := (Nso,---,Ns4) is an SMS rv in R+! with pertaining D-norm 
P , generated by (Zig gen vg Zay): 


In what follows, we define an SMS process 7 = (7:)+e{0,1) for which Às, = 
Ns;, i =0,...,d, holds, i.e., interpolates the finite dimensional projections 
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(Nso>+-+3Nsq) Of the original SMS process 7 in an appropriate way. This is 
done by means of a special case of the generalized max-linear model, i.e., by a 
particular choice of the functions g; in equation (4.10). We show that this way 
of predicting 7 in space is reasonable, as the pointwise mean squared error 


2 
MSE (ni?) = E (n — at) ) vanishes for all t € [0,1] as d increases. 


Moreover, we establish uniform convergence of the “predictive” processes and 


the corresponding generator processes to the original ones. 


PROPER CHOICE OF AUXILIARY FUNCTIONS 


As shown in Lemma 4.3.1, the stochastic process 7 = (Ĥt)tejo,ı]; defined 
through its margins by 


7 Ns; 
= max —+, te[0,1, 
ee = 


is an SMS process with generator process Z = (Z:)tefo,1, given by 


Z = max (gilt) Zs; ) , te (0, 1], 
i=0,...,d 
for arbitrary functions go,.-.,gq in Ct[0,1] that satisfy condition (4.9). We 
are going to specify these auxiliary functions now. 
Denote by || ‘ID... the D-norm pertaining to the bivariate rv (7s,_,, 1s; ); 
i=1,...,d. Put 


sy —t 


oC SO) 
g(t) = ¢ lsi illoa 
0, else, 
t— Si—1 
; tE |Si-1, Sil], 
CS ae Pena 
gj (t) = — mt t€ [sisi], =b d71, 
[siz = t t= si)ll Dia 
0, else, 
t — Sq 
jen ee te [Sa—1, Sa], 
gi(t) = & lsa = t, t= sazi) Daia (4.12) 
0, else. 


Clearly, gj,..-,9% € CĦ[0, 1]: the fact that a D-norm is standardized implies 


: Si — Si—1 Si+1 — Si ; x 
fn = Tossa, Men 8 0lp,,, ee 


Moreover, we have, for t € [s;-1, si], i = 1,...,d, 
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MO Oloa = OOo, = 1 


pete 


Hence, the functions gj,..., 97 are suitable for the generalized max-linear 
model (4.10). In addition, they have the following property: 


Lemma 4.3.2 The functions g5,..., 97 defined above satisfy 


II9i loo = 9% (Si) 


In view of their properties described above, the functions gf work as ker- 
nels in non-parametric kernel density estimation. Each function g} (t) has max- 
imum value 1 at t = s;, and, with the distance between t and s; increasing, 
the value g*(t) shrinks to zero. 


Proof (of Lemma 4.3.2). From the fact that a D-norm is monotone and stan- 
dardized, we obtain, for i = 1,...,d—1 and t € [s;-1, $i), 
t— Si—1 1 1 
pen a e e S aa S 
I|(si = t,t- 8i-1)|lp I sit 1) | (9, Dllo 
Diii 


i—l,i 
and for t € [si, Si+1) 


Si+1 —t 1 1 
g; (t) = 


ae E E 
Il (Sia —t,t- SID, aya | (1 S ) | i 
Di iya 


> si4ı—t (L O)| oai 


Analogously, we have gj < 1 and gi < 1. The assertion now follows since 
gž(si) = 1, i= 0,...,d. 


THE SMS PROCESS WITH THESE AUXILIARY FUNCTIONS 


The SMS process Ĥ = (Ĥt)tejo,ı] that is generated by the generalized max- 


linear model with these particular functions gğ,...,g} is given by 
fy = max ( Msi- Ms: ) 
gi (t)’ g7 (t) 
Nsi—ı Ns; 
= ||(si— t,t- si- a =), 4.13 
[i= tt- sedlo max (Be, e), aas) 


for t € [si-1,s:], i =1,...,d. Note that ns; < 0 a.s., i = 0,...,d. This implies 
that the maximum, taken over d+ 1 points in (4.10), goes down to a maximum 
taken over only two points in (4.13), since all except two of the g; vanish in 
t € [s;-1, si], i =1,...,d. We have, moreover, 


fsi = Nsi» i=0,...,d, 


thus, the above process interpolates the rv (Nso, - - - , s4). In summary, we have 
established the following result. 
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Corollary 4.3.3 Let n = (n)tejo,1] be an SMS process with generator 
Z = (Z:)tejo,1j, and let 0 = sp < si <,...,< sg_1 < sg = 1 bea 
grid of points in the interval [0,1]. The process 4 = (7)te[0,1] defined in 
(4.13) is an SMS process with generator process Z= (Zt)te[0,1); where 


max ((si —t)Z,, ,,(§- si-1)Zs,) 


Peet) ee 
(ear F Pac 


i—l,i 


The processes ù) and Z interpolate the rv Mao o ea) and (Zao 
respectively. 


We call 4 the discretized version of n and Z the discretized version of Z, 
both with grid {s9,..., sa}. 


MAX AND MIN OF DISCRETIZED VERSIONS 


We show that the discretized version of the underlying SMS process converges 
to this very process in a strong sense. We need the following two lemmas, which 
provide technical insight into the structure of the chosen max-linear model. 


Lemma 4.3.4 The SMS process defined in (4.13) fulfills, for i = 
1 


pees 


sup mM = Max 0S , Ns; ) , 
téE[si—1,3:] 


ace i ~ (sae) Ib, 


4-154 


This minimum is attained for t = (8i-17s;_, + $iNs;)/(Ns;_1 +s; )- 


Proof. We know from Lemma 4.3.2 that g*_,(t),g*(t) < 1 for an arbitrary 
i= 1,...,d and t € [s;-1, si]. Hence, 


Nsi—ı Ns; 
gt) oF (t) 


for i = 1,...,d and t € [s;_1, 5;]. The fact that g*_,(s;-1) = 1 = g}(s;) 
yields the first part of the assertion. Recall that ns, < 0 with probability one, 
i=0...,d. Moreover, for t € (s;-1, si), we have 


MH = max ( ) < max(7s,_, ? Ns:) 


Msi < nsi Si—t < Nsi—ı t> Si—1si—1 F Sinsi , 
S;—t t— Si—1 t= Sii Ns; Nsi—ı + 1s; 


where equality in one of these expressions occurs iff it does in the other two. 
In this case of equality, we have 
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Ns; 


te = ||(si — t,t = si-1)llD, 1 aa [|s 1; Msi) ecu . 
E 


On the other hand, the monotonicity of a D-norm implies, for every t € 
(si-1, Si), with t > (Si—1si—1 + Sinsi )/(Nsi—ı Te Ns), 


e > ||(si — t,t — si—1)| 

= = 1) 
t= Sj-1 

> (= ; 1) 
Ns; 


The case t < (si—1Ns;_ı + Sins;)/(Ns:—ı + Ns;) works analogously. o 


Di-1,4 t— 


Ns; 
Di-1,i 


Ns, = || (sia + Msi) 
Di-1,4 


loa: ` 


As an immediate consequence of the preceding result, we obtain, for x < 0, 
NITE => MaX (Nso, Nsa) <2; 


L= 


>r 4 max ||(ns 11s) 


1<i<d Ber 


The next lemma is on the structure of the underlying generator processes. 
It is shown by repeating the arguments in the proof of Lemma 4.3.2. 
Lemma 4.3.5 The generator process defined in (4.14) fulfills, for i = 
PEA, 
sup Z, = max (Zaris 2a) ; 
t€[si~1,9%] 


Moreover, fori =1,...,d, 


=i 
Pomme Zeer) opm Ve eae Zane Zee 
t€[si_1,8:] 0 else. 


inf 


The minimum is attained for t = (8;-1Z5, + $i:Zs,_,)/ (Zire + Za) in 
the first case. 


In (2.28), we introduced the extremal coefficient ||1||, as a measure of 
tail dependence. The preceding result shows in particular that the extremal 


coefficient Æ (supreto,y Zn) of the SMS process Ĥ coincides with the extremal 
coefficient E(max;=o,....¢ Zs, ), which corresponds to the SMS rv (Nso; ---, sa): 


UNIFORM CONVERGENCE OF DISCRETIZED VERSIONS 


So far, we have only considered a fixed discretized version of an SMS process. 
The next step is to examine a sequence of discretized versions with certain 
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grids, whose diameter converges to zero. It turns out that such a sequence 
converges to the initial SMS process in the function space C[0, 1] equipped 
with the sup-norm. Thus, our method is suitable for reconstructing the initial 
process. 

Let 


D SD garg < O 


d 
a = {s® (Sy page, 186° <S xo <8 = 1, deN, 


be a sequence of grids in [0,1] with diameter 


Kq i= max Cs — s(,) >d 0. 


Let WY = (AP heto be the discretized version of an SMS process ņ = 

(m)tejo,1] With grid Ga. Denote using ŻA = (2°) Ba and Z = (4%:)te(0,1] 
t€[0,1 

the generator processes pertaining to 7 and 7 respectively. Uniform con- 


vergence of 4 to 7 and of Z™ to Z, as d tends to infinity, is established in 
the next result. 


Theorem 4.3.6 The processes and Ê, d € N, converge 
uniformly to n and Z pathwise, i.e., 4 -nll >d>œ 0 and 


|Z = z| d+ 0 with probability one. 


Proof. Denote by [t]a, d € N, the left neighbor of t € [0,1] among Ga, and 
by (t)a, d € N, the right neighbor of t € [0,1] among Ga. Choose a sequence 
of numbers s® e [0,1], d € N, with s gen s € [0, 1]. Then, obviously 
[sa asco s and (s\) 1 >a 8. Hence, we obtain by Lemma 4.3.4, and 
the continuity of the process n 


pla) < max A) — max > 
Iga = s€[[s]a,(s) 4] 7 (isda Ms) q) Sao Ns» 
as well as 
a(d : 
ie 2 mia Ba lln sdD]a N Msd)a) l doo Ns, 


2€[fo]u (a) a] [d] (dya 


where ||: denotes the D-norm pertaining to (Nisa), Misy). 
D [s(P]ar Ks) a 


KORRE 
Hence, the fee part cf the assertion is proven. 

Now, we show that Z +4... Z in (C[0, 1], lllo). If Zs # 0, the con- 
tinuity of Z implies Zisa] #0 4 Za), for sufficiently large values of d. 
Repeating the above arguments, the assertion now follows from Lemma 4.3.5. 
If Z, = 0, the continuity of Z implies 


2a < 2 max (Zisa Zisa) >d>æ 2Zs = 0, 
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which completes the proof. Check that 


[Ka -tt - | > 1/2, 
Disda, d) 


since every D-norm is monotone and standardized. 


INTERPOLATING A BROWN—RESNICK PROCESS 


A nice example is the SMS Brown-Resnick process 7, which is defined via the 
standard geometric Brownian motion 


t 
Zt := exp (2, — 5) ; t € (0, 1], 


as in (1.26), i.e., we have 


Pans f)=E ( sup (soi) =exp(-lfllp), f€ E (0,1). 


te [0,1] 


The complete D-norm || f ||) is unknown, but in Lemma 1.10.6, we have com- 
puted the bivariate D-norm 


_ pig (Y ,, ow (lel uh) 
lelo., = le (F + SE) 
+ plo (EF + eled) 


for x,y € R and 0 < s < t < 1, which is a bivariate Hüsler-Reiss D-norm. 
Writing s; instead of 3, etc., to ease notation, the interpolating SMS 
process defined in (4.13), now becomes 


fy = ((s _ 10 (48 q les ((si — t/t- “D 


2 VSi — ae 
i— Si, log(t— si—1/(si— t 
P t- siao (SE, et iale) 
2 Si — Si—1 
x max (7 ts), t € [sj;-1,8;], 1<i<d. 
s;—t t— Sj_1 


WHAT IF THE UNDERLYING D-NORM IS UNKNOWN? 


The preceding Theorem 4.3.6 is the main reason why we consider the dis- 
cretized version Ĥ of an SMS process ņ a reasonable predictor of this pro- 
cess, where the prediction is done in space, not in time. The predictions 
fj, of the points m, t € [0,1], only depend on the multivariate observations 
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(Nso: -- Nsa). More precisely, the only additional thing we need to know to 
make these predictions is the set of the adjacent bivariate marginal distribu- 
tions of (s9,---,sq), that is, the bivariate D-norms I'll, a> t= sayd: 
However, this may be a restrictive condition in practice and suggests the prob- 
lem of how to fit models of bivariate D-norms to data, which is beyond the 
scope of the present book. The Brown—Resnick process, including additional 
parameters, may serve as a parametric model to start with. 

The following results, however, are obvious. Let 7 be a point of the dis- 
cretized version defined in (4.13) and define a defective discretized version 
via 


= "si —1 Ns; 
— i — t,t — si—1)|| p, max ; —— ], 
us salle, (= =) 


for t € [siz1;,s:], i = 1,...,d, where ||:||5, is an arbitrary D-norm on R?, 
which we call the defective norm. Then, for every t € [si—1, si], i = 1,...,d, 
lin — fel = [Ii = tt = sidlo, = l(i = t, t= si-l, 
< min "si. , Ns; . 
Si — t t— Si—1 
In particular, we have ñs; = s; = s;,1 = 0, ...,d. This means that we obtain 


an interpolating process even if we replace the D-norm ||-||p,_, , with the de- 
fective norm ||-||,,. Furthermore, the defective discretized version still defines 
a max-stable process with sample paths in C7 [0,1] = {f € C[0,1]: f < O}. 
Check that its univariate marginal distributions are given by 


(si = tt slow, 
— te). 20, 


Pit < £) = 
(i < x) os II(si — t,t- sill, 


for t € [s;-1,5;], i = 1,...,d. These are still negative exponential distribu- 
tions, but not standard ones, as they are with the discretized version given 
in (4.13). In addition to this, the assertions in Lemma 4.3.4 also hold for the 
defective discretized version, since each defective norm |j-|| 5, is monotone and 
standardized. Repeating the arguments in the proof of Theorem 4.3.6 now 
shows that the uniform convergence toward the original process 77 is retained 
if we replace the norms ||-||p,_, , with arbitrary monotone and standardized 
norms ||-||5,. Note that Lemma 1.5.2 implies that these two properties al- 
ready imply that the bivariate norm ||-||5, is a D-norm. In that case, the 
only property of the discretized version that we lose is the standardization 
of the univariate margins, i.e., the resulting process is no longer a standard 
max-stable process. 


MEAN SQUARED ERRORS OF DISCRETIZED VERSIONS 


To calculate the mean squared error of the predictor 7, we have to determine 
the mixed moment E'(™7,). We could apply Lemma 2.5.1 if we knew that the 
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vector (m, Ñ+) was standard max-stable itself. This is verified in the following 
result. 


Lemma 4.3.7 Let n = (m)iejo,1) be an SMS process and denote by 
Ñ = (MH )eelo,1) its discretized version with grid {so,..., 8a}. Then, the 
bivariate rv (mM, i) is an SMS rv for every t € [0,1] with corresponding 
D-norm of the two-dimensional marginal 


Illy, =||(eataOv,9Oy) |). te bas, @=1,..4, 
t,i—1,i 


where Hla is the D-norm pertaining to (Nt, Nsi—1;Ns;)- 


1,4 


Proof. For every t € [si-1, si], x,y < 0 andi =1,...,d, we have 
Pim < z, ñ < y) 
= P (m < £, ns, < gf 1®y, ne < g; (t)y) 
= exp ( — E( max ( |e] Ze, 97-1 (8) lul Ze- 97 © lul Zs.) )) 
= exp (= E( max ( [a] Ze, Iyl max (97_1(0Zs,.,97(0Zs.)) )): 
The vector 
(Zi, max (97a (t)Zs 197 (0)Zs,)) 


defines a generator for every t € [si—1, si], i = 1,...,d, as for all such t 


=1. 


E ( max (9_1()2s.-49%()2s,)) = ME EO Il... 


Let us recall the sequence of processes that we discussed in Theorem 4.3.6. 
Suppose that ņ is an SMS process and choose a sequence of grids Ga of the 
interval [0,1] with diameter k4 a+. 0. Denote using HO, d € N, the 
sequence of discretized versions of 7 with grid Ga. Denote further using ||-|| p 
the D-norm as in Lemma 4.3.7, pertaining to (m, HO), te [0,1], d EN. 

Theorem 4.3.8 Letn and O, d € N, be as above. The mean squared 
error of Ah is given by 


MSE (4) =E (n z Ay) 


1 


o awla 
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Proof. The second moment of a standard negative exponentially distributed 
rv is two, and therefore, we obtain from Lemma 2.5.1 and Lemma 4.3.7 


MSE (ni?) = E (nf) - 2E (at) da (a2) 


= 1 
=4-2 f — du. 
0 Eolo 


Next, we show ||:|| p —>a>o0 ||:||,, pointwise for all t € [0,1]. Denote using 
Z and Z™, d € N, the generator processes of 7 and 7, d € N. Define 


SUPze [0,1] Zt 
ae 


m:= E| sup 4} <œ and Z= 
te [0,1] 


Then, E(Z) = 1, and thus, (Z:+, Z), define a generator of a D-norm ||-|| 5 on 


R? for all t € [0,1]. Lemma 4.3.5 implies ĉ{® < mŽ for all d € N. Therefore, 
for arbitrary x,y € R, d € N and t € [0,1], we have 


max (lel Zt, |y| 21°) < max G Zt, |my| 2) ; 
where 7 
E (max (|| Za myl Z:)) = lœ m)l < 00. 
Hence, we can apply the dominated convergence theorem to the sequence 


max G Zt, |y| ae), d € N. Together with the fact that Zo dsc Zr for 
all ¢ € [0,1] by Theorem 4.3.6, we obtain for x,y € R 


Ie, Ilo = E (max (lel Za yl 20) ) 
a> E (max (l2| Ze, ly) Zi) = l)la 


In Example 2.5.3, we already calculated [5° ||(1, wlz du = 2. Since ||la 
is the smallest D-norm, we have, for all d € N and t € [0,1], 
1 e 1 
Iolo ~ 1G, wl 


therefore, by applying the dominated convergence theorem again, we obtain 


that - i >% i 
—— du > d—00 | Faa du = 2, 
J Ia olipa 0 ewl 


which completes the proof. 


The generalized max-linear model, as considered in this section for SMS 
processes, carries over to GPP in a straightforward manner. For details, we 
refer the reader to Falk et al. (2015). 


® 


Check for 
updates 
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Further Applications of D-Norms 
to Probability & Statistics 


5.1 Max-Characteristic Function 


This section introduces max-characteristic functions (max-CF's), which are 
an offspring of D-norms. A max-CF characterizes the distribution of an rv in 
IR¢, whose components are non-negative and have finite expectation. Pointwise 
convergence of a max-CF is shown to be equivalent to convergence with respect 
to the Wasserstein metric. An inversion formula for max-CF is established as 
well. 

As discussed in Section 1.2, neither the generator of a D-norm nor its 
distribution is uniquely determined. However, given a generator Z of a D- 
norm on Rt, we can design a D-norm on R¢*! in a simple fashion so that it 
characterizes the distribution of Z: consider the D-norm on R¢+1 


|t, 2) || 5 := E (max(|t|,|zı| 21,...,|ea| Za), tER, 2 € Rê. 


Then, it turns out that the knowledge of this D-norm fully identifies the 
distribution of Z; it is actually enough to know this D-norm when t = 1, as 
the following Lemma 5.1.1 shows, and this shall be the basis for our definition 
of a max-CF. By =p we mean equality of the distributions. 


Lemma 5.1.1 Let X = (Xı,..., Xa) > 0, Y = (M%1,.-., Yu) > 0 be 
rus with E(X;), E(Y;) < œ, 1 < i < d. If we have, for each æ > 0 € R$, 


E (max(1, z1Xı,.. ., £a Xa)) = E (max(1,271¥%1,...,x2aYa)), 


then X =p Y. 


Proof. From Lemma 1.2.2, for arbitrary 2 > 0 € R and c > 0, we obtain the 
equation 
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X X gi X X 
E (max (1,4....,24)) =| 1 = P (max (1, 24,..., 4) <+) dt 
C£1 CX 0 CL CL 
0 
1 
The substitution t +> t/c yields that the right-hand side above equals 
1 f> ; 
c E 


Repeating the preceding arguments with Y; in place of X;, we obtain, for all 
c > 0 from the assumption in Lemma 5.1.1, the equality 


J -PÆ stsi<soa= | 1— P(Y; < tz, 1 <i < d)dt. 


Taking right derivatives with respect to c, we obtain, for c > 0, 


1 — P(X; < cx, 1 <i <d) =1-— P(Y; < cx; 1<i<d) 


and, thus, the assertion. 


The following characterization extends Lemma 5.1.1. 


Lemma 5.1.2 Let X =(X1,...,Xa) 2 0, Y = (Y1, ..., Ya) > 0 be rus 
with finite expectations in each component. Suppose there is a number 
a €R such that, for all x > 0 € R$, 


E (max (1,21.X1,...¢aXa)) = E (max(a,2iVi,...,2aYa)). (5-1) 


Then, a = 1 and X =p Y. 


Proof. For a < 0 and æ > 0 € R?, we clearly have 
E (max (a, £1Y1,..., £aYa)) = E (max (#1¥1,...,vaYa)). 


Thus, we can suppose a > 0. Multiplying both sides of equation (5.1) with an 
arbitrary number c > 0, we obtain the working assumption 


Pe, x (£) := E (max(c, £t1X1,... , £4 Xa)) 
= E (max(ac, £1Y1,.. . , £aYa)) 


=: Yea y (2), c>0,2>0€R?. (5.2) 
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Now, according to Lemma 1.2.2, we have, for any c > 0 and any æ > 0, 


1 cae X X 
vox (4) = f 1- P (max (e,24,...,24) <1) dt 
x 0 ry Ld 


=c+ f 1— P(X; < taj, 1 < i < d) dt. 
By the same arguments, we obtain 
1 = : 
Yea,¥ (=) = a+ | 1— P(Y; < taj, 1< i< d)dt. 


If a = 0, the latter identity becomes 
1 we x 
Po,Y (=) =|} 1— P(Y; ig 10) ae, 
x 0 
so that equation (5.2) becomes 
c+ f 1— P(X; < ta, i<ixaa= f 1— P(Y; < taj, 1<i<d)dt 
K 0 


for each c > 0 and x > 0 € Rĉ. Taking right derivatives with respect to c 
yields 
Letting c — oo clearly produces a contradiction. 

Suppose next that a > 0. We have 


+ f I= P(X; <tr 1S i <a) dt= cat f 1— P(Y; < tx, 1 <i< d)dt. 


a 
Taking right derivatives with respect to c again entails 


Letting c + co gives 1 = a — 1 + 1 and, therefore, a = 1. Our basic as- 
sumption is, thus, equivalent with the one in Lemma 5.1.1, which implies 
X =pY. 


DEFINITION OF MAX-CHARACTERISTIC FUNCTION 


Let Z = (Z1,..., Za) be an rv whose components are non-negative and inte- 
grable. Then, we call 


pz(x):= E (max (1, £1Z1,...,Z4Za)), 


with {x = (x1,..., £4) > 0 € Rt, the maz-characteristic function pertain- 
ing to Z. Lemma 5.1.1 shows that the distribution of a non-negative and 
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integrable rv Z is uniquely determined by its max-CF. Note: although we 
use the notation Z for a non-negative rv here and in what follows, we do 
not require it to be the generator of a D-norm, i.e., we do not require that 
its components have expectation one. We only require them to have finite 
expectation. 

The definition of a max-CF can be extended to arbitrary, not necessarily 
non-negative rvs X = (X1,...,Xa) by putting Z := (exp(X)),...,exp(Xa)), 
provided that E(exp(X;)) < œ, 1 < i < d. The distribution of Z clearly iden- 
tifies that of X. A prominent example is an rv X that follows a multivariate 
normal distribution. In this case, each component Z; = exp(X;) is log-normal 
distributed; see also the definition of the Hiisler-Reiss D-norm in (1.6). 


BASIC PROPERTIES OF MAXx-CF 


Some obvious properties of yz are yz(0) = 1, yz(a) > 1 for all x, and, with 
r € [0, 00), 


We will show next that any max-CF is a convex function; thus, it is continu- 
ous and differentiable almost everywhere (see, for example, Rockafellar (1970, 
Theorem 25.5)); furthermore, its derivative from the right exists everywhere. 
This fact is used in Proposition 5.1.18, where we establish an inversion formula 
for max-CFs. 

When Z has bounded components, we obviously have yz(#) = 1 ina 
neighborhood of the origin by the definition of yz(a). 


Lemma 5.1.3 Any maz-CF yz is a convex function. 


Proof. Let pz be the max-CF of the rv Z = (Z1,..., Za) on R4. We can 
assume wlog that no component Z; is a.s. equal to zero. We have to show 
that, for any A € [0,1], 


pz(r@ + (1—A)y) < Ava (a) + (1— A)va(y), z,y>O0eER?. 


Put 
X := (Xo, ---, Xa) = (1,4%,...,Za). 


A repetition of the arguments in the proof of Lemma 1.1.3 yields that 
p y. = d+1 
ell = E (gx (ln) Xi), E= (eosa) ERM, 


defines a norm on R¢*?; but ||-||ẹ is not necessarily a D-norm, as we only 
require E(X;) € (0,00) for i =1,...,d. 

Each norm ||-|| is a convex function, which is an obvious consequence of 
the triangle inequality and its homogeneity: 
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làs + (1 — A)yl] < [Axl] + 1. — Adal 
=Allel|+(—A)|lyll, A € [0,1]. 


As a consequence, we obtain, for x,y > 0 € R? and A € [0,1], 


pz (Aw + (1 — A)y) 

= E( max (1, (Ari + @ = MeFi Oey HA \)ya)Za)) 

5 E( max (à +(1-A), Azi = A Fig Oey \)ya)Za) ) 
sjati Ae P= dying eg A Aa 


SAIC £1- ta) +A = A) A y+ ya)llx 


= Apa (x) + (1 — A)paly), 


which proves the convexity of yz. 


Lemma 5.1.4 The set of max-CFs is convex, i.e., if p1 and p2 are 
maz-CF on [0,00)4, then 


yy := Agi + (1 = A)p2 


is a maz-CF for each A € [0,1]. 


The proof of Lemma 5.1.4 repeats the arguments in the proof of Propo- 
sition 1.4.1, which states that the set of D-norms is convex. It provides in 
particular an rv Z), whose max-CF is given by px. 


Proof. Let Z, Z) be rvs with corresponding max-CFs y1, p2. Take an rv 
€ that attains only the values one and two, with probability P(E = 1) = À = 
1 — P(€ = 2), and suppose that € is independent of Z®) and of Z). Note 


that | i 
ge) .— (22, — 2) 


is an rv in R¢, whose components are non-negative and have finite expectation. 
Precisely, for © = (x1,..., £4) > 0, we obtain 


E (max (Lazh, = eaZ) ) 


Il 
Me 


Il 
m 


E (max (paan, ps ,2aZ) 1(€ = i)) 


a 


II 
Me 


Il 
un 


E (max (1,220, an a) 1(é= i) 
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II 
Me 


Il 
m 


E (max (1,221, = eazi) | E(1(é =i) 


a 


II 
Ms 


E (max (1.2.2), ih ,2aZQ)) P(€ =i) 


Api (x) + (1 — A)ya(z), 


which completes the proof. 


©. 
Il 


THE MAX-CF IN THE UNIVARIATE CASE 


When d = 1, the max-CF of a non-negative and integrable rv Z is, according 
to Lemma 1.2.2, 


vpz(x) = E (max (1, xZ)) 


=1+ f P (zZ >t) dt 
1 


=1+2/ P(Z >t) dt 
1/ax 

=1+2E((Z — 1/x)1(Z > 1/2)). 
The latter expression is connected to the expected shortfall of Z (see Embrechts 
et al. (1997)). Indeed, if gz(u) := inf {t > 0: P(Z < t) > u}, u € (0,1), is the 
quantile function of Z, then the expected shortfall of Z for a € (0,1) is defined 
by 
E(Z1(Z > qz(a))) 


ESz(a) := E(Z | Z > qz(a)) = P(Z > qz(a)) 


Putting £a := 1/qz(a), we obtain 
ga(ta) = 1+ aE ((Z — qz(a))1(Z > qz(a))) 


= 1+zaP(Z > qz(a))(ESz(a) — qz(a)) 
= 1 + £aSP z(a), 


where 
SPz(a) = P(Z > qz(a))\(ESz (a) — qz(a)) 


is the stop-loss premium risk measure of Z; see Embrechts et al. (1997). 

The preceding remarks suggest that max-CFs might be closely connected 
to well-known elementary objects such as conditional expectations and risk 
measures; a particular consequence of it is that computing a max-CF may, 
in certain cases, be much easier than computing a standard characteristic 
function (CF), i.e., a Fourier transform. The following example illustrates 
this idea. 
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Example 5.1.5 Let Z be an rv that has the GPD with location pa- 
rameter u > 0, scale parameter o > 0 and shape parameter € € (0,1), 
whose df is 


at: 
PZ sa=1-(1+62—4) een 


The expression of the CF of this distribution is a fairly involved one, 

which depends on the Gamma function evaluated in the complex plane. 
However, it is straightforward to show that, for all z > 0, 

oO ; 

E(Z)-z=p-— IË if 2 <h, 

i) P(Z > z2) dz= 

x o 


i 
Hence, the max-CF of Z is 


The next example is a consequence of the Pickands—de Haan—Resnick rep- 
resentation of a max-stable df; see Theorem 2.3.4. 


Example 5.1.6 Let G be a d-dimensional max-stable df with identical 
univariate Fréchet margins G;(a) = exp(—a2~%), x > 0, a > 1. Then, 
there exists a D-norm ||-||,, on R? such that G(x) = exp (— ||1/2||p), 
x > 0 € Rt. The max-CF yg of G is, for æ > 0 € R, 


xel 
p(o) a 
y 


Sie / een oe 


1/a 
1/||e || 


CHARACTERIZING POINTWISE CONVERGENCE OF MAX-CFS 


Denote using dw (P,Q) the Wasserstein metric between two probability dis- 
tributions on R? with finite first moments; see Section 1.8. Recall that conver- 
gence of probability measures P,, to Po with respect to the Wasserstein metric 
is equivalent to weak convergence together with convergence of the moments 
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[lees Palda) trace f lel Polda); 
R4 R4 


see, for example, Villani (2009). Let X and Y be integrable rvs in R? with 
distributions P and Q. Using dw (X,Y) := dw (P,Q) we denote the Wasser- 
stein distance between X and Y. The next result precisely says that point- 
wise convergence of max-CFs is equivalent to convergence with respect to the 
Wasserstein metric. 


Theorem 5.1.7 Let Z, Z™, n € N, be non-negative and integrable 
rus in R! with the pertaining max-CFs pz, pzm, n € N. Then 


Yuin) >n=>œ Yz pointwise iff dw (Z™, Z) >n l. 


Proof. Suppose that dw(Z™ , Z) nso 0. Then, we can find versions Z™ 
and Z with EF (|Z — Z|) >n>o 0. For æ = (z1, ..., Za) > 0, this implies 


Pz (£) = E(max(1, A = Z)),...,ea(Za = Za)))) 


< E (max(1, £121,- .. , £aZa)) + llæll o E (|Z — Z||,) 
> E(max(1,712,...,¢aZa)) — |læll £ (|Z — Z|) 


= yz(x) + o(1). 


Suppose next that yam) —n=>>œ Yz pointwise. For t > 0 and z = 
(£1; ..., £a) > 0, we have 


toza) (=) =+ f 1- P (zZ <y,1 <i<d) dy, 
t 


tez (=) =t+ f 1— P(2:Zi<y,1 <i <d) dy. (5.3) 
t 


Putting t = £ and æ = e;, for € > 0 and 1 <i < d, these equations imply 


E (z) = fo 1-P(2 <y) dy 
0 
= f 1-P(2r <y) dy + O(e) 
sno | 1- P(Z: <v) dy + Ole) 
= E(Z;) + Ole). 


Letting £ tend to zero entails convergence of E (a) to E (Z;). Therefore, 


we have to establish weak convergence of Z) to Z. From equation (5.3), we 
obtain, for 0 < s < t and æ = (z1,..., £a) 2 0, 
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x x i (n) ' 
tp zn) (=) — S~Z(n) (=) = P (zz! <yl<i< d) dy 


s 
T T 
noo tpz (=) — SPZ (=) 


t 
= | P (xiZi < y,1 <i < d) dy. (5.4) 


Let x = (a1,...,2a) > 0 be a point of continuity of the df of Z. Suppose 
first that x > 0. Then, we have 


1 
p (2) <2) =P(L2 <11sisa). 
Ti 


If 


4 


lim sup P (Hz <1,1<i< ‘) > p(+z <1,1<i< a) 
x Ti 


i 2&4 Se 


tte l 1 , 
tmint P (+2 <1 1sisd) <P(Last1sisd), 
Ti 


then equation (5.4) readily produces a contradiction by putting s = 1 and 
t=1+e, ort =1 and s= 1 — e€ with a small € > 0. Thus, we have 


P (z < z) >na P(Z < x) (5.5) 


for each point of continuity £x = (£1,..., £4) of the df of Z with strictly 
positive components. 

Suppose next that z; = 0 for j € T C {1,...,d}, x; > 0 fori g T, T #4. 
In this case, we have 


by the assumed continuity from the left of the df of Z at x. Thus, we have to 
establish 


lim sup P (z™ < x) = lim sup P (a <a,i¢T; 2 <0,j5€ T) = 0) 
Suppose that 
lim sup P (z <enigT,Z™ <0, je T) =c>0. 
Choose a point of continuity y > a. Then, we obtain 
0<c<limsup P (zm < y) =P(Z< y) 
n= 


by equation (5.5). Letting y converge to x, we obtain P(Z < <x) > c > 0 and, 
thus, a contradiction. This completes the proof of Theorem 5.1.7. 


196 5 Farther Applications of D-Norms to Probability & Statistics 
SOME APPLICATIONS TO MULTIVARIATE EXTREMES 


Convergence of a sequence of max-CFs is therefore stronger than the conver- 
gence of standard CFs: choose a sequence of real-valued rvs Zn, n € N, such 
that 


P(Zn =e") =4 and Pa OS, neN. 
n n 


Then, Zn converges to zero in probability and, therefore, in distribution, but 
the sequence of moments does not converge to zero as well, as E(Z,) = 
e” /n >n OO. 

The following Corollary 5.1.8, which is obtained by simply rewriting The- 
orem 5.1.7, is tailored to applications in multivariate extreme value theory. 


Corollary 5.1.8 Let X™, n € N, be independent copies of an rv 
X in RÊ that is non-negative and integrable in each component. Let 
€ = (&1,...,ĉ4) be a maz-stable rv with Fréchet margins P(&; < x) = 
exp (—1/a“), x >0, a; >1,1<i<d. Then, from Theorem 5.1.7, we 
obtain the equivalence 


dw pss < £) noo 0 


a™ 
for some norming sequence 0 < a™ e RI iff 


Pn n> PE pointwise, 


where Yn denotes the max-CF of maxi<i<n XO /a™, neN. 


The following example shows a nice application of the use of max-CFs 
to the convergence of the componentwise maxima of independent generalized 
Pareto rvs. 


Example 5.1.9 Let U be an rv that is uniformly distributed on (0, 1), 

and let Z = (Z),..., Za) be the generator of a D-norm ||-||, with the 

additional property that each Z; is bounded, i.e., Zi < c, 1 < i < d, for 

some constant c > 1. We require that U and Z are independent. 
Then, the rv 


1 a fat 
VeVi va = oe (41 7, ) 


with a > 0 follows a multivariate GPD; see Section 2.2. Precisely, for 
x > (ct/%,...,c!/*) € R4, we have 


5.1 Max-Characteristic Function 
Now, let V™, V@),... be independent copies of V and put 


Y™ = n1 max VO, 


l<i<n 


Then, we have, for x > 0 € R? and n large, 


eeose- (l 
D 


—n—-oco EXD (- zo 
D 


= P(E < x), (5.6) 


where € is a max-stable rv with identical Fréchet margins P(é; < x) = 
exp(—1/x%“), x > 0. Choose a > 1; in this case, the components of V 
and € have finite expectations. By writing 


oe 1 
Pym) (#) =1 +f ee a (vo < iz) dt 
i! 


and using equation (5.6), elementary arguments, such as a Taylor ex- 
pansion, make it possible to show that the sequence of max-CFs yy in) 
converges pointwise to the max-CF yẹ of €. Since convergence with 
respect to the Wasserstein metric is equivalent to convergence in dis- 
tribution together with convergence of the moments, we obtain from 
Theorem 5.1.7 that, in this example, we actually have both Y —>p € 


and E Ca Soe BG) = Wd —1/e) tr i <i<ad 


Example 5.1.10 Let U“,U®),... be independent copies of the rv 
U = (U,,...,Ua), which follows a copula on R%, i.e., each U; is uni- 
formly distributed on (0,1). From Proposition 3.1.10, we know that 
there exists an SMS rv n = (m,...,7a) in Rt such that 


VM:=n ( max UY) — 1) >p 


1<j<n 


iff there exists a D-norm ||:||p on R, such that for æ < 0 € R? 
P(V™ <a) neo exp (—|letllp) =: G(@), 


or iff there exists a D-norm ||:||p on R? such that 
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Clu) =1-||1 -ullo + 0(|1 — ull) 


as u— 1, uniformly for u € [0, 1]¢. 
For 1 < i < d, we have convergence of the first moments 


z (3) J i Sar alpen 
B(n(1 an )) =H pete 1 = BC mi), 


thus, we obtain from Theorem 5.1.7 the characterization 


V™ spn 
n (v™,n) Be a 


<= P yín) nœ Y-n pointwise. 


For instance, when d = 2, straightforward computations yield that —7 
is a weak limit above iff it has a max-CF of the form 


p-n(x) 
i 


= 1+ 2 exp(—1/21) + x2 exp(—1/2#2) — Walp exp(— ||1/2l|p)- 


CONVERGENCE OF D-NoRMS, THEIR GENERATORS, AND 
MAaAx-CFs 


The following consequences of Theorem 5.1.7 supplement Proposition 1.8.3, 
which is formulated for generators of D-norms, whose components add up to 
the constant d. Now, we drop this condition. 


Corollary 5.1.11 Let Z, Z™, n € N be generators of D-norms on 
Ri. Then Pain) Sn 0 Yz pointwise iff Zap Z. 


Interestingly, the convergence of a sequence of max-CFs of generators of D- 
norms also implies pointwise convergence of the related D-norms. We denote 
by |l-||p,z that D-norm generated by Z. 


Corollary 5.1.12 Let Z, Z™ , n €N be generators of D-norms in R¢ 
with pertaining max-CFs pz, Pz; n E€ N. Then, pointwise conver- 


gence Yzin) >n YZ implies ||-|| p gin) >n=> ||| p, z pointwise. 


5.1 Max-Characteristic Function 199 


Proof. For æ > 0 and proper versions of Z™) and Z, we have 


lælpm =F (max, (2.2{”) 
= (a, (2:2; + zi(Z® - 2))) 
=E (nas (2) +0 (E (2 E z|) 


1<i<d 


no E ( max (esZ:)) = lel 


The following consequence of Corollaries 5.1.11 and 5.1.12 is obvious. 


Corollary 5.1.13 Let Z, Z™ be arbitrary generators of the D- 
norms |illp Illo, on R, n EN. f Z™ >p Z, then ||-|p, n>% 


IIIb, pointwise. 


The reverse implication in the preceding result is not true; just put Z) := 
(1,...,1) € R? and Z™ := (X,...,X), where X > 0 is an rv with E(X) = 0. 
Both generate the sup-norm ||-||,,, but, clearly, Z (2) Ay soo Z®, unless 
X=las. 


A CENTRAL LIMIT THEOREM FOR D-NORMS 


The following example may be viewed as a central limit theorem for D- 
norms. It is closely related to the multiplication stability of a Htisler—Reiss 
D-norm in Example 1.9.2; the multiplication of D-norms is introduced in 
Section 1.9. 


Example 5.1.14 Let Y = (%,..., Ya) be an rv such that E(Y;) = 0 
and 0 < u; := E(exp(¥;)) < co for 1 <i < d. Then, 


Ze pom na) 


prey 


Hı Ha 
= (exp(Yi — log(u1)), . - - ,exp(Y4 — log(ua))) 


is the generator of a D-norm. 

Suppose that E (Y?) < œ, 1 < i < d, and let Y®, Y,... be 
independent copies of Y. Then, the multivariate central limit theorem 
is applicable, i.e., 


E 
Ta YoY >p N(O,5), 


j=1 
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with covariance matrix X = (04; )1<i,j,<a = (E(ViY;))1<i,j,<a. The rv 
exp (Em) exp oa 

E (exp Oaa oe (exp (vs? /vn)) 


with 1 < j < n and n € N, defines a triangular array of generators of 
D-norms ||: IlDjn that are identical in each row, i.e., ||- la =: |llp,; 
LSJ ann © N. Note that Zine Zau ole araa rvs for 
each n > 2, and thus, 


z™ = Il Ze 
j=l 


(a[i mee (cw 28)))} 
EA) 


Zijn = 


generates the product D-norm ||-|| pr: Note that E(exp(Y/V/n)) < co 
for n € N if E(exp(Y)) < œ. 

Using the Taylor expansion exp(x) = 1 + 2 + exp(vx)xz?/2, with 
some 0 < ù < 1, x € R, and log(1 + £) = £ + O(e?) as € > 0, it is easy 
to see that 


ou $) 


The multivariate central limit theorem, together with the continuous 
mapping theorem, implies 


A A (xı- m ey = m) : (5.7) 


where the rv X = (Xj,...,Xq) follows the multivariate normal distri- 
bution N(0, X). 

The norm, which is generated by Z), is the Hiisler-Reiss D-norm 
|l- lur, introduced in (1.6). As a consequence, we obtain from (5.7) and 
Corollary 5.1.13 the pointwise convergence of the product of D-norms 


lllo n00 Illar - 
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BOUNDING A MAX-CF By THE D-NORM 


The following lemma relates a D-norm to the max-CF of its generator. 


Lemma 5.1.15 If pz is the maz-CF of a generator Z of a D-norm 
II'llp.z» then for all x € [0, o0)? 


max(1, ||2\|p,z) < pz(æ) < 1+ |lellp,z- 


Especially if Z denotes the set of all generators of D-norms, then 


xz 
ae pz (x) 


—1|—>0 forze [0, o0)? with ||x\|,, + 0. 
Zez lællp z 


Proof. The lower bound is obtained by noting that 
1 < max(1, z1 Z1, ..., £aZa) 
and 
max(x1Z1,...,@a@Za) < max(1,271Z1,...,aZa) 


and taking expectations. The upper bound is a consequence of the inequality 
max(a,b) < a+b, valid when a,b > 0. Finally, the uniform convergence result 
is obtained by writing 


1 
< pz (x) Z 
|z||p,z |z||p,z 


for all Z € Z and all x € R4 \ {0}. Because ||- ||p,z > || - llo, this entails 


sup 
ZEZ 


pz(æ) _ |< = 
lællD,z llælloo 


from which the conclusion follows. 


It is worth noting that the inequalities of Lemma 5.1.15 are sharp in the 
sense that, for Z = (1,...,1), pz(x) = max(1,||x||..) = max(1, ||æ||D,z). 
Therefore, the leftmost inequality is in fact an equality in this case, whereas 
the rightmost inequality yz(x) < a+ b||æ||D,z can only be true if a,b > 1 
because of the leftmost inequality. 

Lemma 5.1.15 has the following consequence. 


Corollary 5.1.16 No constant function can be the maz-CF of a gen- 
erator of a D-norm. 


Such a result is, of course, not true for standard max-CFs, since, for in- 
stance, the max-CF of the constant rv zero is the constant function one. The 
next result supplements Lemma 5.1.15. 
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Lemma 5.1.17 Let Z =(Zj,..., Za) be a generator of a D-norm ||-|| p 
on R¢, and let pz be its maz-CF. Then, we obtain 


Jim (pa(z) - |iælp) = P(Z = 0), 


where the convergence x — œ means that each component x; of x 
converges to infinity. 


Proof. From Lemma 1.2.2, we obtain for æ = (@1,...,%a) > 0 


ya(x) — ||| p 
= E(max(1,714,...,%aZa)) — E (suax (0.2) ) 


= h P(max(1, £x1Z1,...,£d4Za) > t) dt -f P ( max (x:Z;) > t) dt 
0 Sts 


= P(Z =0) 


by the dominated convergence theorem. 


AN INVERSION FORMULA FOR THE MAx-CF 
It is clear from the bound 
lælp z < 92(@) < 1+ |lællp z, z>0eR’, 


in Lemma 5.1.15 that the D-norm ||:||p z can be deduced from yz, since 


i pa (tx) 
3o ae 


3 = lælp, z, z>O0eR?. 
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In the next result, we establish a direct inversion formula for a non-negative 
and componentwise integrable rv, not necessarily the generator of a D-norm. 

We have seen in Lemma 5.1.3 that each max-CF is a convex function; 
thus, it is continuous and differentiable almost everywhere (see, for example, 
Rockafellar (1970, Theorem 25.5)); besides, its derivative from the right exists 
everywhere. 


The next result contains both an inversion formula for a max-CF and a 
criterion for a function to be a max-CF. 


Proposition 5.1.18 Let Z be a non-negative and integrable rv with 
maiz-CF pz. 


(i) For all x = (a1,...,2a) > 0, we have 


P(Z <x) = (toz (=)) mi 
= mo (a +h)pz (a) ae ©) 


where 04 denotes the right derivative. 


(it) Ify is a continuously differentiable function, such that for all x = 
(Dieses BH) >0 


a (=) 


lim t 
t= oo 


. 1 
jim t el —1)})=0, x = (Tiye ta) > 0, 


for any max-CF y. 


Proof. Notice first that, for all t > 0, 


1 Zı Za 
— |] =tE T 
i (=) ' (max ( try" )) 
Z Z 
= E (max (12, E) 
Tı Tad 
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This gives 
1 z Z Z 
toz (4) -| P (max (1,4...) >y) dy 
tx 0 Tı Ta 
=t+ f P (max (£...) > y) ay 
t Ly Ld 


=t f 1— P(Z; < yz£j, 1 < j < d) dy. 
t 


This representation yields in particular that limy_,. t(gz(1/(tx)) — 1) = 0. 
To show (i), notice that taking right derivatives with respect to t yields 


om 1 . 
—— — — ` < 5 < < . 
At (toz (=)) P(Z; <ta;, 1<j<d) 


Setting t = 1 concludes the proof of (i). To prove (ii), note that, for all t > 0, 


d 
o 1 1 1 1 1 
— — E —]-- —d;v | — 5.8 
ðt (w(4)) »(Z) ag iv (+). ey 
where 0;7 denotes the partial derivative of w with respect to its jth compo- 
nent. In particular, because 


o 1 
P(Z; < £j, i<i<d=5(w(Z)) i 


SOG) es 


i=1 “9 


we obtain from equation (5.8), by replacing x with ta in equation (5.9), that, 
for all t > 0 


Now write 


to conclude the proof of (ii). 
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Example 5.1.19 Let G be a d-dimensional max-stable df with iden- 
tical univariate Fréchet margins G;(a) = exp(—a~°*), x > 0, a > 1. 
According to Example 5.1.6, the max-CF of G is given by 


palz) =1 +4 |la™||7/° T, Te 1—exp(-y™*)dy, 2 >0€R’, 
1/|læ°Ħ |5 


with some D-norm ||:||p on R?. The inversion formula immediately 


3: (e (a)l eel) 


for x > 0 € R?. 


For additional results on max-CFs, we refer to Falk and Stupfler (2017). 


5.2 Multivariate Order Statistics: The Intermediate Case 


Asymptotic normality of intermediate order statistics, which are taken from 
univariate iid rvs, is well known. We generalize this result to rvs in arbitrary 
dimensions, where the order statistics are taken componentwise. D-norms turn 
out to be quite helpful again. 


INTRODUCING MULTIVARIATE ORDER STATISTICS 


Let X® = (K raa) XM = aX) be independent 
copies of an rv X = (X,,...,Xq) that realizes in R. Using 


Xi:n,i < Xani < ma < Xn:n,i 


we denote the ordered values of the i-th components of X@),...,X(,1< 
i < d. Then, (Xj,:n1,---,Xjgn,a), with 1 < j1,...,ja < n, is an rv of order 
statistics (os) in each component. We call it a multivariate os. 

The univariate case d = 1 is, clearly, well investigated; standard refer- 
ences are the books by David (1981); Reiss (1989); Galambos (1987); David 
and Nagaraja (2005); Arnold et al. (2008), among others. In the multivariate 
case d > 2, the focus has been on the investigation of the rvs of component- 
wise maxima (Xn:n,1;- -< , Xn:n,a); see Chapter 2 and 3 of this book and the 
references given therein. 

Much less is known in the extremal case (Xn—k:n,1,---;Xn—kg:n,d) With 
ki,...,ka € N fixed; one reference is Galambos (1975). Asymptotic normality 
of the rv (Xj,:n,1,---;Xjq:n,a) in the case of central os is established in Reiss 
(1989, Theorem 7.1.2). In this case, the indices j; = j;(n) depend on n and 
have to satisfy j;(n)/n >n» qi €(0,1),1<i<d. 


206 5 Further Applications of D-Norms to Probability & Statistics 
INTERMEDIATE ORDER STATISTICS 


In the case of intermediate os, we require ji = ji(n) = n — ki, where k; = 
k(n) >n 00 with ki/n >n 0. Asymptotic normality of intermediate os 
in the univariate case under fairly general von Mises conditions was established 
in Falk (1989). Balkema and de Haan (1978a) and Balkema and de Haan 
(1978b, Theorem 7.1) proved that for particular underlying df F, Xn-k+1:n 
may have any limiting distribution if it is suitably standardized and if the 
sequence k is chosen appropriately. 

As pointed out by Smirnov (1967), a non-degenerate limiting distribution 
of Xn—k+1:n, different from the normal one, can only occur if k has an exact 
preassigned asymptotic behavior. Assuming only k no 00, k/n n-y00 
0, Smirnov (1967) gave necessary and sufficient conditions for F such that 
Xn-k+1:n is asymptotically normal, and he specified the appropriate norming 
constants; see condition (5.14) below. 

Smirnov’s result was extended to multivariate intermediate os by Cheng 
et al. (1997). They identify the class of limiting distributions of the rv 
(Xn—kinjly-++)Xn—kg:n,d) after suitable normalizing and centering, and give 
necessary and sufficient conditions for weak convergence. 

Cooil (1985) established multivariate extensions of the univariate case 
by considering vectors of intermediate os (Xn—k,41:n;---;Xn—kytiin) taken 
from the same sample of univariate os Xi:n < +- < Xn:n, but with pair- 
wise different ky,...,kq. Barakat (2001) investigated the limit distribution of 
bivariate os in all nine possible combinations of central, intermediate, and 
extreme os. 

According to Sklar’s theorem 3.1.1, the df of X = (Xj,...,Xqa) can be 
decomposed into a copula and the df F; of each component X;, 1 < i < d. 
We establish in what follows asymptotic normality of the vector of multivari- 
ate os (Xn—ky:n,1;--+;Xn—kg:n,d) in the intermediate case. This is achieved 
under the condition that the copula corresponding to X is in the max- 
domain of attraction of a multivariate extreme value df, together with the 
assumption that each univariate marginal df F; satisfies a von Mises con- 
dition and that the norming constants satisfy Smirnov’s condition (5.14) 
below. 


MAIN RESULTS: COPULA CASE 


We consider first the case that the df of the rv X is a copula, say C, on R4. 
We require C to be in the max-domain of attraction of an SMS df G, as in 
Section 3.1. In this case, according to Theorem 2.3.3, there exists a D-norm 
I|:|| 5 on R? such that 
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G(x) = exp (—||z||p), x <O0€R’. 
From Proposition 3.1.5, we know that C € D(G) is equivalent to the expansion 
C(u) =1-|[1— ally +0(l1— ul) (5.10) 


as u — 1, uniformly for u € [0, 1]?. 

We are now ready to state asymptotic normality of the vector of multivari- 
ate os in the intermediate case following a copula. The proof of Theorem 5.2.1 
is postponed. 


Theorem 5.2.1 (The Copula Case) Suppose that X = (X1,...,Xa) 
follows a copula C, which satisfies expansion (5.10) with some D-norm 
l-|p on R¢. Let k = k(n) = (ki,...,ha) € {1,...,.n—1}"%, n € N, 
satisfy ki/kj > ki, € (0,00) for all pairs of components 1 < i,j < d, 
|k|| > co and ||k|| /n + 0 as n — oo. Then, the rv of componentwise 
intermediate os is asymptotically normal: 


d 
n n— ki 
—— — N (0, x7 
vki n e \ Ms 


where the d x d-covariance matrix X is given by 


ii ifi=j, 
kij + kji = ||kijei + kjiejllp ; if 7 A J: 


B= (ou) =| 


If, for example, the underlying D-norm ||:||p is the logistic norm ||æ||p = 
lell, = (E2: leal?)'/, p > 1, then oj = hig + kj — (K8, + RR)? iZi. 

Note that oj; = 0, i Æ J, if |l-l|p = |l-I],, which is the case for inde- 
pendent margins of G(x) = exp(- ||æl|p) = IŁ, exp(z;), x < 0 € R°. 
Then, the components of X = (X,,...,Xq) are tail independent. The re- 
verse implication is true as well, i.e., the preceding result entails that the 


componentwise intermediate 08 Xn—k,:n,1,---;Xn—kgin,d are asymptotically 
independent iff they are pairwise asymptotically independent. But this is 
one of Takahashi’s characterizations of ||-||p = ||-||,; see Corollary 1.3.5 and 


Theorem 2.3.8. 

Note that oij > 0 for each pair i, j, i.e., the componentwise os are asymp- 
totically positively correlated. This is an obvious consequence of the fact that 
each D-norm ||:||p is pointwise less than ||-||,; see (1.4). 
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Corollary 5.2.2 If we choose identical k; in the preceding result, i.e., 
kı =--- = ka = k, then, under the conditions of Theorem 5.2.1, we 
obtain 


n—k\? 


+p N(0, X) 


aes ile ifi = j, 
= Oij = 
7% \2-le:+e;lp, ifi# j. 


Let Uyen < Uz2:n < +++ < Un:n denote the os of n independent and uniformly 
on the (0,1) distributed rvs U1,...,Un. It is well known that 


n Sm Ej 7 
(Unde =p (== , 
peas Ej i=l 


where Fy,..., En+1 are iid standard exponential rvs; see equation (2.29). 

Let €1,&,--.,&€2(n41) be iid standard normal distributed rvs. From the 
fact that (€? + €3) /2 follows the standard exponential distribution on (0, 00), 
we obtain the representation 


Head \ 
(Ung) =D O (5.11) 


Corollary 5.2.2 now opens up a way of tackling a multivariate extension of 
the above representation (5.11) at least partially and asymptotically. 


Corollary 5.2.3 Suppose that the d x d-matrix A with entries 


ee aly Es 
Ne eo hg ane 1/2 TA 7 
: e eral iti 4, 


is positive semidefinite and let €),€°),... be independent copies of 
the rv € = (&1,...,&a), which follows the normal distribution N(0, A) 


on R?. Then, under the conditions of Corollary 5.2.2, we obtain 


See Gol 
aed sane} <* 
reR@ 2(n+1 j 
i j= (£ ) aei 


Spasa 0: 
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Note that the univariate margins yi H (ef 2i oie (g Dy i= 
1,...,d, in the above result have identical distributions, due to equation 
(5.11). 

The d x d-matrix (Aj;) = (aij) is by Corollary 5.2.2 positive definite. 
However, if a matrix with non-negative entries is positive semidefinite, the 
matrix of the square roots of its entries is not necessarily positive semidefinite 
again. Take, for example, the 3 x 3-matrix 


10a 
A=|{0la 


aal 


This matrix is positive definite for a = 1/3'/?, but not for a = 1/3™4. This is 
the reason why we require the extra condition in Corollary 5.2.3 that the ma- 
trix A is positive semidefinite. The matrix A is, for example, positive semidef- 
inite if the value of ||e; + e;||p does not depend on the pair of indices i Æ j, 
in which case A satisfies the compound symmetry condition. 


Proof (of Corollary 5.2.3). From Corollary 5.2.2, we obtain that 


n n—k\4 
— (Kins — ) >D N(O, X). 


n i=1 


The assertion follows if we establish 


” Di k) ‘Gy J Pe 


Arena A 


d 


>D N(0, X) 
i=l 


as well. But this follows from the central limit theorem and elementary argu- 
ments, using the fact that Cov(X?, Y?) = 2c? if (X,Y) is bivariate normal 
with Cov( X,Y) =c 


THE PROOF OF THEOREM 5.2.1 


The proof of Theorem 5.2.1 requires a suitable multivariate central limit the- 
orem for arrays. To ease its reference we state it explicitly here. It follows 
from the univariate version based on Lindeberg’s condition, together with the 
Cramér—Wold device; see, for example, Billingsley (1999, 2012). Recall that 
all operations on vectors are meant componentwise. 
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Lemma 5.2.4 (Multivariate Central Limit Theorem for Arrays) 


Let Benen 60 be iid rvs with mean zero for each n € N, and 
bounded by some constant m = (m4,...,ma) > O € R¢. Suppose 
there is a sequence cl), n € N, in R? with nes” =>n>œ © for 
i=1,...,d, such that the covariance matriz of xP can be written as 
Cov (xt) = CM SOC n EN, where OM = diag ( am) and 
the matrices X™ , n € N, satisfy 2° >n X (meant element-wise). 
Then, 


1 


X® >p N(0, X). 
Fao n >p N(0, X) 


Proof (of Theorem 5.2.1). Choose æ = (x1,.. 
ments yield 


(Ge ens) oaz 


. £a) € R°. Elementary argu- 


j=1 
JE 2 (n 
d 
(Pe pne) <3) 
a i=1 
Now, put 


d 
y”) = (ri )..4¥5 ) = (1 (xe [08 a, += D) 
m i=1 


with values in {0,1}. The entries of its covariance matrix 5) = (ol) for 
i Æ j are given by 


=e (KO) -e (e) eC) 
) Pd =) = 


SP 


u 
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SẸ; — ki /k; — k; 
-P(x%< hya EE #)e (xs Ipp z) 
n n n 


n 


n 


k; — ki kj —k; 
= Cy (En , Xa, +t z) 


a 


; — k: k; des 
(n+ A) (Bn 8) 
n n n n 


if n is large, where 


Cijlu, v) = P(X; < u, X; < v) 
=C (1 — (1 — u)e; — (1 — v)ej), u,v € [0,1]. 


Expansion (5.10) now implies for the case i Æ j 


x P ~ ki k; 4/ k; 
oj b Rig. e, + |~2~—~ 2; |e; 
" n n san he 


n n 
i i \/k; k; \/ kik; 
- (Be ss)( tay Hot) sof E) 
n n n n 
ki ki o aSk 
-E - Ba) er (2- as) 
n n n 


D 
ki k; \/ kik; 
n n 
kik; 
= = J (kij + hye = ||kijei + kjiejllp + o(1)) : 


For i = j, one deduces 
off) = (1+ o(1)). 
n 


The asymptotic normality N (0, X)(—oo0, x] of the final term in equation 
(5.12) now follows from Lemma 5.2.4. 


MAIN RESULTS: GENERAL CASE 


Let F be a df on R? with univariate margins F,,..., Fa. From Sklar’s the- 
orem 3.1.1 we know that there exists a copula C on R? such that F(a) = 
C(F,(21),-.-,Fa(aa)) for each æ = (21,...,2a) € RË. 

Let X,X@),... be independent copies of the rv X, which follows this 
df F. We can assume the representation 
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X = (F7 '(U1),..., Fz Ua) 


where U = (U1,...,U4) follows the copula C and F7t(u) = inf{t € R : 
F(t) > u}, u € (0,1), is the generalized inverse of F;, 1 < i < d. Equally, we 
can assume the representation 


XO = (EO (UP e 0) ER 


where U“),U@),... are independent copies of U. 

Put w(F;) := sup{# € R: Fi(x) <1} € (—o0, oo], which is the upper 
endpoint of the support of F;, and suppose that the derivative F/ = f; exists 
and is positive throughout some left neighborhood of w(F;). Let k; = ki(n) € 
{1,...,n — 1} satisfy ki Sno 00, ki/n —>n—=o 0. It follows from Falk (1989, 
Theorem 2.1) that under appropriate von Mises-type conditions on F;, stated 
below, we have convergence in distribution 


Ky huand = dni 


Cni 


>D N (0, 1) 


for any sequences Cni > 0, dni € R that satisfy 


ni $ dni g bni 
lim ĈŻ =1 and lim Ti 9, (5.13) 
n— oo Ani Noo Ani 
where 
ki vki ; 
bni := F! 1-—), ani = —-~, 1<i<d. 
n n filbni) 


Theorem 1 of Smirnov (1967) shows that the distribution of cy !(Xn—k,:n— 
dn) converges weakly to N(0,1) for some choices of constants cn > 0, dn € R, 
iff for any x € R 

ki + n(Fj (cnx + dn) — 1) 


VON MISES-TYPE CONDITIONS 


Next, we state the three von Mises-type conditions, under which we have 
asymptotic normality for intermediate multivariate os in the general case: 
w(Fi) € (—00, co] and 


fila) [2° 1 — F(t) at 


lim ek von Mises (1 
wn) =e)? l 0) 
w(F;) = co and there exists a; > 0 such that 
im aS = ai; (von Mises (2)) 
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w(F;) < co and there exists a; > 0 such that 


(w(Fi) = x) fila) 


lim = Qi. von Mises (3 


The standard normal df, as well as the standard exponential df, satisfies con- 
dition (1); the Pareto df Fy(#) = 1-—2~*%, x > 1, a > 0, satisfies condition 
(2), and the triangular df on (—1, 1), with density f(x) = 1-|z|, x € (—1,1), 
satisfies condition (3) with a = 2, for example. For a discussion of these well- 
studied and general conditions, each of which ensures that F; is in the domain 
of attraction of a univariate EVD, see, for example, Falk (1989). 


ASYMPTOTIC NORMALITY: THE GENERAL CASE 


The following generalization of Theorem 5.2.1 can now easily be established. 


Proposition 5.2.5 Let the rv X have df F. Suppose that the copula C 
of F satisfies condition (5.10), i.e., C is in the max-domain of attraction 
of an SMS df, and suppose that each univariate margin F; of F satisfies 
one of the von Mises-type conditions (1), (2), or (3). 

Letk=k™ e {1,...,n- DE n E€ N, satisfy ki/kj >n=oo k? E 
(0,00) for all pairs of components i,j = 1,...,d, ||k]| >n=>œ œ and 
|k|| /n >n=>æ 0. Then, the vector of intermediate multivariate os sat- 
isfies 


Xas int Ui i 
—— | >p N (0,5), 


Cni i=1 


with X as in Theorem 5.2.1, for any sequences Cni > 0, dni € R, that 
satisfy (5.13). 


Proof. For x = (x1,..., £4) € RI, we have 
p (Zeini h <n, 1 <i ed) 
Cni 


=P (Ert (Un—k;:n,i) < Cniti + dni, 1 <i< d) 
38 a e ee) 


The assertion is now a consequence of Theorem 5.2.1 and Smirnov’s condition 
(5.14). 


,1<i<d), 
n 


5.3 Multivariate Records and Champions 


Records among a sequence of iid rv X“), X®),... on the real line have been 
investigated extensively over the past few decades. A record is defined as an 


214 5 Further Applications of D-Norms to Probability & Statistics 


rv X™ such that X > max(X,...,X@—). Trying to generalize this 
concept for the case of random vectors, or even stochastic processes with 
continuous sample paths, gives rise to the question of how to define records 
in higher dimensions. We consider two different concepts: a simple record is 
meant to be an rv X(”) that is larger than X),...,X(—) in at least one 
component, whereas a complete record has to be larger than its predecessors 
in all components. In addition to this sequential approach, we say that a set 
of rvs X,...,X contains a champion if there is an index i € {1,...,n} 
with X® > XG, j Æ i. In this case, X is called the champion among 
XO RO, 


‘TERMINOLOGY 


Let X, X, X2) be iid rv in Rê with continuous df F. We call X™ a 
simple tecnd if x (n) £ mMaXj=1,....n—1 x , and we call it a complete record 
if X@ > maxjar n-1 XM (Figures 5.1-5.3). We further define 


etal 


t,(X):=P (x is a simple record) i 


Tn( X) := P (x is a complete record) i 


Fig. 5.1: Data set at time n — 1. 


By definition, the first observation X® is always a record; thus, we de- 
mand 7(X) = 2,(X) = 1. In the univariate case, where X, X®, X@),... 
are rvs on the real line, records are much easier to handle, and clearly 
Tn(X) = m (X) = 1/n: with probability one there is a single strictly largest 
observation among X®,..., X™ by the continuity of F, i.e., 


= r[U{x X > max xo) S > max x0). 
1<ifxj<n 1<ifj<n 
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x a simple record 


Fig. 5.2: Data set at time n: a simple record. 


_x a complete record 


Fig. 5.3: Data set at time n+ 1: a complete record. 


The iid assumption on X®,..., X) provides 


P(x > Hiss x) =P (xi > max x), j=1,...,n— 1. 


1<i<n—1 1<iFj<n 


These two equations together imply 


1 
P (x > max x0) =-. 
1< n 


<i<n-1 


There is much detailed work on univariate records; see, for example, 
Galambos (1987, Sections 6.2 and 6.3), Resnick (1987, Chapter 4), and Arnold 
et al. (1998). Results on the limiting distribution of joint records have been 
recently derived by Barakat and Abd Elgawad (2017) and Falk et al. (2018). 
Multivariate records have not been discussed that extensively, yet they have 
been approached by Goldie and Resnick (1989, 1995) and Arnold et al. (1998, 
Chapter 8), among others. For supplementary material on multivariate and 
functional records, we refer the reader to Zott (2016) and Dombry et al. (2018). 
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IT’S THE COPULA 

According to Sklar’s theorem 3.1.1, the df F has the representation 
F(x) = C(Fi(21),..., Fa(xa)), B= (Gieta); 


where C is a copula on R? and F,,..., Fy are the univariate margins of F. 
Therefore, we can assume the representation 


x = (A (uy) ee ya (us?) ae ee 


ue = (OP ee) i=1,2... 


where 


are iid rvs that follow the copula C. 
Recall that since F is continuous, the margins are continuous as well, and 
in this case, C is uniquely determined by 


C(u) =F (FI (u1), , Fy (ua)) ; u = (u1, ..., Ua) € (0,1)%. 
Being a record (or a champion) depends on U™® , not on the df F, if this 
is a continuous function. This is the message of the next lemma. 


Lemma 5.3.1 If the underlying df F is continuous, then we have with 
probability one for each n € N 


X) is a (simple/complete) record or a champion 


<= UČ) is a (simple/complete) record or a champion. 


Proof. We make use of the general equivalence 


Fol(u)<a = u < F(z), u € (0,1), £ ER, 


a 


or, equivalently, 


F(u) >x 4 u> F(a), u € (0,1), £z € R, 


au 


which can be established by elementary arguments. For a continuous df F;, we 
have F;(F;'(u)) = u, u € (0,1). It is also well known that the multivariate df 
F is continuous iff the univariate margins are; see, for example, Reiss (1989, 
Lemma 2.2.6). Thus, we obtain with probability one for i 4 k 


AM sae) BU) sau) 


= os B UP) =e. 


This proves the assertion. 
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CONCURRENCY OF EXTREMES AND CHAMPIONS 


A concept that is closely related to the field of complete records is the so- 
called concurrency of extremes, which is due to Dombry et al. (2017). We say 
that X,...,X are sample concurrent if 


max X® = xX) for some k € {1,...,n}. 


In that case, we call X) the champion among X),...,X(™. Note that, 
different from univariate iid observations X1,...,Xq with a continuous df F, 
there is not necessarily a champion for multivariate observations. 

We denote the sample concurrence probability by p,(X) and, due to the 
iid property, obtain as before 


U{xo> max xo) 
1<jži<n 


= nm, (X). (5.15) 


If the limit limpo Pn( X) exists in [0, 1], we call it the extremal concurrence 
probability. 

Different than records, the concept of multivariate and functional champi- 
ons is very recent. It has been established in the work of Dombry et al. (2017). 
In their paper, they derive the limit sample concurrence probability under iid 
rvs X®,..., X™ in R¢. There are also many results on statistical inference 
in their work. The D-norm approach provides an elegant formulation of their 
results; see below. 

According to the Lemma 5.3.1, we can assume wlog that the observed iid 
rvs follow a copula, say C. To emphasize this assumption, in what follows, we 
use the notation U instead of X. 


Theorem 5.3.2 Let U),U®),... be independent copies of the rv U, 
which follows a copula C on RÊ satisfying C € D(G), where G is an 
SMS df with corresponding D-norm ||:||p. Then, 


Pr(U) = nTa (U) Fns0 E (UND), 


where the ru n has df G. 


Proof. The condition C € D(G) implies 


M” :=n __max (u — 1) >D N. 


i=1,...,.n—-1 


218 5 Further Applications of D-Norms to Probability & Statistics 


Conditioning on M™ =a < 0 € R¢ yields 


nTa (U) = l en nP (n(U — 1) > x) (P * M™) (dæ) 


= fyr (Px mM) (dx) 


since M“™ and U are independent. Setting Yp := gn (M a), we need to 
show 
nTn (U) = E(Yn) >n» E (Hp). 


It is enough to verify (Billingsley (1968, p. 32)): 


(i) Yn >p UX. 
(ii) There exists £ > 0 with suppen E a 2a. 


Note that (ii) implies the uniform integrability of the sequence (Y;)nen. 
We first show (i). From Lemma 3.1.13 we obtain 


for £n, æ < 0 € R¢ with ||æn — 2||,, noo 0. Now noticing that M™ +p 
7, the assertion is immediate from the extended continuous mapping theorem; 
(Billingsley (1968, Theorem 5.5)). 

Now we prove part (ii). Elementary calculations show that, for all n > 2, 


E (YA = _ n?P (n(U —1) > x}? (P ‘ mM) (dx) 


< J n?P (n (U1 — 1) > z)? (P , m$) (dx) 
(—2,0] 


= <2, 
n+17 


which completes the proof of Theorem 5.3.2. 


The preceding result clearly implies an expansion of the expected number 
of complete records as the sample size n increases. 
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Corollary 5.3.3 Denote by R(n) := X; 1(U > maxi<j<; UO) 
the number of complete records among U“,...,U“). Then, under the 
conditions of Theorem 5.3.2, 


E(R(n)) 


O ns ntp) 


Proof. We have 


It is well known that ()>;"_, 1/i) /log(n) >n 1. The assertion follows from 
Theorem 5.3.2 and equation (5.15) together with elementary arguments. 


The following lemma provides an alternative representation of the extremal 
concurrence probability. 


Lemma 5.3.4 Let n = (s)seg be an SMS rv with corresponding D- 
norm ||-||p. Let Z = (Z1,...,Za) be a generator of ||-||p. Then, we 
have 


E MND) =E (aZ > 0)) 


Further, for x = (z1,..., £4) <0 E RË, we have 


E (Q max(n, x) Xp) 
255 (T (1 ae (111/21, ax (2:2) (Z> 0)) 


Proof. Wlog, we can choose a generator Z of ||-||,, that is independent of n. 
Then, by conditioning on 7, 


P (amiaza) = f eP (ERAZ) (Pate 
=| UL UD (P x n)(dæ) 
(—00,0]4 


=E(Wnp). 


Lemma 1.2.2, and the fact that 7 and Z are independent entail 
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E (sn, nl 20) = [er ( rai min (nil Zi) >t) dt 


P (nj < —t/Zi, 1<i<d) dt 


—t/zi, 1< i < d) (P x Z)(dz) dt 


pre 
E 


“ha T aT 1(z > 0) (P * Z)(dz) 


1 


=E (a > 0)) ; 


which is the first assertion. The second assertion can be shown by repeating 
the above arguments. 


Example 5.3.5 (Independence and Perfect Dependence) A 
generator of the special D-norm ||-||5> = Illo; which characterizes 
the complete dependence of the univariate margins of 7, is given by the 
constant Z = 1. In that case, Lemma 5.3.4 implies that the extremal 
concurrence probability is one, i.e., Pn(U) = nFp(U) >n>o 1. 

In contrast to that, we have 


il : 


In particular, this is the case when at least two components ni, Nj, 
i # j, are independent. This is due to the fact that the bivariate D- 
norm corresponding to (m, nj) is the bivariate logistic norm ||-||,, with 
generator (Z;, Zj). But l-l, = 0 and thus min(Z;, Z;) = 0 a.s. by (1.12) 
and Corollary 1.6.3. 


ASYMPTOTIC DISTRIBUTION OF COMPLETE RECORDS 


Having established the extremal concurrence probability, we can now derive 
the limit survival function of a complete record. We have to restrict ourselves 
to the case where P(Z > 0) > 0, which is equivalent to the condition that 
the extremal concurrence probability is positive; see (5.16). Just as before, we 
consider the copula case first. 
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Proposition 5.3.6 In addition to the assumptions of Theorem 5.8.2, 
suppose that the generator fulfills P(Z > 0) > 0. Then, for x <0 € R$, 


Hg) := P (n (u — 1) > x | U™ is a complete record) 


E (Q max(n, x) ùp) 


noo H := 
Parea Hole) En vp) 


és 


where n = (m)i<i<a is an SMS rv with corresponding D-norm ||-|| p- 


Note that we avoid division by zero in the preceding formula by the as- 
sumption P(Z > 0) > 0. 


Proof. We have 


- I,„(£) _ P (n(U — 1) > x,U > maxi=1,...n-1 U®) 
Tn P (U > MaXj=1....n—-1 U) 


parag 


According to Theorem 5.3.2, it remains to show that, for each x € R4, 
nil,(2) =nP (nu — 1) > max (2. mM) noo E N max(n, £) Xp), 


where M™ := n maxi=1,...n—1 (U® — 1). This can be done by repeating the 
arguments of the proof of Theorem 5.3.2. 


Another representation of Hp(æ) is given by 


B (garb (ex (1/2 maxisisa(e:2)) JZ > 0)) 


E ey > 0)) 
where Z is a generator of ||:||p. This is due to Lemma 5.3.4. 


Example 5.3.7 For the Marshall-Olkin D-norm 


b 


(5.17) 


Izl, =Allello + (1-A) lial, æ ERf, A€ (0,1), 


we obtain 


H, (x) = 1 — exp (Ito, max a) , z <0, 


which is the survival function of the max-stable rv (7,.--,7)/||1llp,; 
where y is standard negative exponentially distributed and ||1||p. = 
A +d(1— åA). Note that this rv has completely dependent components. 
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Although it is not common, we provide a proof of the preceding Exam- 
ple 5.3.7 as well. 


Proof. A generator of the Marshall-Olkin D-norm ||-||,, is given by 
Z:=€(1,...,1I)+(1-£)Z", 


where € is an rv with P(€é = 1) = A = 1 — P(€ = 0), and € is independent 
of Z*, which is a random permutation of the vector (d,0,...,0) with equal 
probability 1/d. Obviously, P(Z > 0,€ = 0) = 0. On the other hand, € = 1 
implies Z = 1. Thus, we obtain from (5.17), for all æ < 0 € RÎ, 


A(x) 


=]— 


geeay 


which completes the proof. 


SIMPLE RECORDS 


So far, we have investigated the (normalized) probability of a complete record 
and in particular, its limit, the extremal concurrence probability. Now, we re- 
peat this procedure, this time for the simple record probability. Unlike before, 
where we were actually dealing with the probability of having a champion, 
normalizing the record probability with the factor n does not yield an inter- 
pretation in terms of a probability in the simple record case. 

The following result is the equivalent of Theorem 5.3.2 and Proposi- 
tion 5.3.6 in the context of multivariate simple records. Let X,X,X@),... 
be iid rvs in R¢ with common continuous df F. Recall that X) is a simple 
record, if 

X™ ¢ max XO, 
1<i<n—1 
and z,,(X) denotes the probability of X(™) being a simple record within the 
iid sequence X®, X... 


Theorem 5.3.8 Let U“),U®),... be independent copies of an rv U € 
R? following a copula C. Suppose that C € D(G), G(x) = exp(— ||æl||p), 
x <O€R?. Letn be an rv with this df G. Then 


NEn (U) >n> E (lInll), 
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P(n(U™ — 1) < æ | U™ is a simple record) 


E(|min(æ, |p) = llel» 
E(\Inllp) 


nse Hale) := £<O0ER*. 


In the one-dimensional case d = 1, we obtain Hp(x) = exp(x), x < 0. Note, 
however, that Hp is not a probability df in general. For instance, take ||-||, = 
\|-||,, which is the largest D-norm. In this case, the components m,..., na of 
n are independent, and we obtain for Œ = (£1,..., £4) < 0 € RI 


m (e) = Leer Elmia n) = lee) _ Eh ele) 


EL E(\nil) d 


This is in general not a probability df on (—oo, 0] since, for example, Hı (æ) 
does not converge to zero if only one component x; converges to —oo. 

On the other hand, take |]-||,, = ||-||,,, which is the least D-norm. In this 
case, the components 7,..., Na of n are completely dependent, i.e., 7, = 72 = 
+++ = Na a.s.; thus, 


= exp(— æl) x = (x1,...,24) LO ERI, 
which is an SMS df according to Theorem 2.3.3. 


Proof (of Theorem 5.3.8). Let Z be a generator of ||:||p, independent of 7. 
Theorem 5.3.2, the inclusion—exclusion principle in Corollary 1.6.2 as well as 
Lemma 1.6.1 yield 


nx, (U) =nP (v £ max ue) 


<i<n-1 


= E((Inllp)- 
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Similarly, one can use Proposition 5.3.6 to show that, for x < 0 € R%, 
nP (n(U — 1) £ min(#, Mn)) noo E (||min(#,7)||p), 


where M,, := nmaxj<1,....n—-1(Un — 1) >p 1. 
From Proposition 3.1.5, we obtain for x < 0 € R? 


nP (U £=) =n(1-P(U<=)) = liælp +001) 


as n increases. 
In summary, we obtain 


nP (u <1+3, Ug% max U“ ) 


l<i<n-1 
= nP (n(U — 1) £ min (æ, M,)) —nP (u oes =) 


noo E (||min(#,7)I|_p) — lællp » 


which completes the proof of Theorem 5.3.8. 


In Corollary 5.3.3, we investigated the expected number of complete 
records as the sample size went to infinity. This can be done analogously 
for simple records. Its proof carries over. 


Corollary 5.3.9 Let X“,X@),... be iid rus in R with a continuous 
df F. Suppose that the copula corresponding to F is in the domain of 
attraction of G(x) = exp(— lelp), z<0eR?. 

Denote by m(n) := X; 1 (X® Z maxi<jc; X) the number of 
simple records among X“),...,X'™. Then, we have 


E(m(n)) 
log(n) 
where 7 follows the df G. 


>n> E (nllo) , 


The arguments in the proof of Theorem 5.3.8 can easily be repeated to 
extend it to the case of a general rv X € R?, whose df is in the domain of 
attraction of a max-stable df. Denote again using 


Cr(u) := F (F7 (w),..., F7 (ua)); u= (ta, ua) € [0,1], 


the copula of a continuous df F on Rf, where F; is the i-th univariate 
marginal df. 
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Corollary 5.3.10 Let X‘), X®),... be independent copies of an rv 
X € R?, whose df F is continuous and its copula Cp satisfies Cp € 
D(G), G(x) = exp(- ||æl|p), x < 0 € R*. We require in addition that 
each univariate margin F; of F is in the domain of attraction of a 
univariate maz-stable df Gi, i.e., there are constants ani > 0, bni E€ R, 
n € N, such that, fori =1,...,d, 


n(1 — F(anit + bni)) >n» — log(Gi(x)) =: —a; (2), G;(x) > 0. 
Then, with an = (Gni,---,@nda); bn = (bni,.--,0na) and (a) = 


(wh (a1), -.-,ta(Za)), © = (a1,---, 2a), Gi(a;) > 0, 1 = 1,...,d, we 
obtain 


<a|X™isa simple record ) noc Hply(zx)). 


Note that in the case d = 1 
Ap(p(x)) =exp(¥(x)) = G(x), G(x) >0. 


Note, moreover, that the assumptions on the df F in the preceding theorem 
are equivalent with the condition F € D(G), where G is a d-dimensional 
max-stable df, together with the condition that F is continuous; see Proposi- 
tion 3.1.10. 


Proof (of Corollary 5.3.10). Assume the representation 
X = (FT? (U1), --, Fg (Ua)), 


where U = (U4, ..., U4) follows the copula Crp of X. Repeating the arguments 
in the proof of Theorem 5.3.8 now implies the assertion. 


(SIMPLE) RECORD TIMES 


We denote using N (n), n > 1, the (simple) record times, i.e., those subsequent 
random indices at which a simple record occurs. Precisely, N(1) = 1, as X® 
is clearly a record, and, for n > 2, 


PE E ia (3) (4) 
N(n) = min (j: j>N(n-1), X o ae is 
As the df F is continuous, the distribution of N(n) does not depend on 
the univariate margins of F; therefore, we assume wlog in what follows that 
F is a copula C on Rf, i.e., each component of X is uniformly distributed 
on (0,1). To emphasize this fact, we use again the notation U = (U1,..., Ua) 
instead of X in what follows. 


226 5 Further Applications of D-Norms to Probability & Statistics 
EXPECTATION OF RECORD TIME 
Conditioning on U® = u, for j > 2 yields 
P(N(2) = j) = P(U® <u)... UI) < UV, UO ¢ U) 
= C(u)? (1 — C(u)) C(du). 
[0,1]2 


Solving the geometric series, we get 


= iP) =)= f F n C(du) +1. (5.18) 


Suppose now that d = 1. Then, we have u = u € [0,1], C(u) = u, and 


£ du + 2 = œ 
u 


which is well known (Galambos (1987, Theorem 6.2.1)). Because N(n) > 
N(2), n > 2, we have E(N (n)) = œ for n > 2 as well. 
Suppose next that d > 2 and that the margins of C are independent, i.e., 


d 
=| [v u = (u1,..., ua) € [0,1]. 
i=1 


Then, we obtain 


C i i 
1 ee w=. i a du ,...dug < œ 
po,ye1—C 0 1-]] f= 1 Ui 


using elementary arguments and, thus, E(.N(2)) < co. This observation gives 
rise to the problem of how to characterize those copulas C on [0,1]¢, with 
d > 2, such that E(.N(2)) is finite. Note that E(N (2)) = œ if the components 
of C are completely dependent. 


CHARACTERIZATION OF FINITE EXPECTATION 


The next result characterizes the case E(N(2)) < oo. It requires no further 
condition on the underlying copula C, i.e., we do not require C € D(G) for 
some SMS dfs G. Its proof only uses the Hoeffding—Fréchet bounds for a 
multivariate df (see, for example, Galambos (1987, Theorem 5.1.1)). 


5.3 Multivariate Records and Champions 227 
Proposition 5.3.11 We have E(N(2)) < œ iff 


1 P(U;> u, PUi2ulsisd) , 
u Co. 


(1-1)? 


Condition (5.19) is trivially satisfied in the case of independent components 
Ui,...,Uq and d > 2. Below, we see that it is roughly satisfied in general if 
there are at least two components that are tail independent. 


Proof. The Hoeffding—Fréchet bounds for a copula C are for u = (u1,..., Ua) € 
[0, 1° 


d 
max (1-44 3u.0) < C(u) < min (w,..., ua). (5.20) 


j=1 
Due to the upper bound in (5.20), we obtain from Lemma 1.2.2 


< | P(u>1-F.1<i<a) dt. 
1 


On the other hand, the lower bound in (5.20) yields 


E(N(2)) -1 a (cw) S1- 2) dt 


= [YP (i-u<disis ) a 


1 
; == <7 < š 
e, p(u>1 pisisd) d 


As a consequence, we have established the equivalence 


E(N(2)) < œ =f p(u>1-F.1<i<a) dt < ov. 
1 
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Substituting t > 1/(1 — t) yields 


= l „>u, 1<i< 
I P tsiera a= f PU 2ulsisd iu, 
1 t 0 (1—u)2 


which completes the proof of Proposition 5.3.11. 


INFINITE EXPECTATION OF RECORD TIME 


The next result provides a criterion for the case E(N(2)) = co in terms of 
D-norms. This requires the additional condition that the underlying copula 
is in the domain of attraction of an SMS df. 


Proposition 5.3.12 Suppose that C € D(G), where the D-norm cor- 


responding to G satisfies 21 Np > 0. Then E(N(2)) = œ. 


Proof. Let U = (Ui,...,Ua) be an rv that follows the copula C. From 
Lemma 3.1.13 and the homogeneity of the dual D-norm function X : wp, we 


obtain 
PU; > u, 1<i<d) 


uti V1 Ud. 
Iu ufl D 


As a consequence, there exists £ € (0,1) such that 


P(U: >u, 1<i<d) _ 1X 


l1—u ~ 2 
for u € [1 —e, 1). This implies 
t P(U;>u,1<i<d) 1 P(Uj>u,1<i<d) 
u> at SS dy, 
a) ee le (1 -— u)? 


1 ? l 
> v % f Fe ass, 
2 j--l-—u 


which completes the proof of Proposition 5.3.12. 


ANOTHER TAIL DEPENDENCE COEFFICIENT 


Suppose that C € D(G). According to Proposition 5.3.12, a finite expectation 
E(N(2)) < oo can only occur if the dual D-norm function satisfies l1 p = 0, 
which is true, for instance, if G has at least two independent margins. 

Let U follow the copula C. Next, we show that E(N(2)) is typically finite 
if U has at least two components U;, U; that are tail independent, i.e., 


lira PU >u|U; >u)=0. 


5.3 Multivariate Records and Champions 229 


Within the class of (bivariate) copulas that are tail independent, 


ee 2 log(1 — u) 7 
` ufi log(P(U1 > u, U2 > u)) 

is a popular measure of tail comparison, provided that this limit exists (Coles 
et al. (1999); Heffernan (2000)). In this case, we have ¥ € [—1,1] (Beirlant 
et al. (2004, (9.83))). For a bivariate normal copula with a coefficient of cor- 
relation p € (—1, 1), it is, for instance, well known that ¥ = p. 

Note that the next result does not require C € D(G). It requires only 
the existence of the above tail dependence coefficient for at least one pair of 
components. 


Proposition 5.3.13 Let U = (Ui,...,Ua) follow a copula C. Suppose 
that there exist indices k # j such that 


2 log(1 — u) 


lim ——>+————— — 1 € [-1, 1). 5.21 
utl log(P(U, > u, U; > u)) SEAN an 


Xk,j = 


Then, we have E(N(2)) < oo. 


Proof. According to Proposition 5.3.11, we have to show 


o 
0 


(uP du < oo. 


But, obviously, 


[ P(U; > u, eee aug [Pee 
yr: U. 
0 


(1 — u)? 1—u)? 
Therefore, we only have to find e € (0,1) such that 


E P(U, > u, Uj > u) 


m=i du < œ. 


Since 
2log(1 — u) 
log(P(U;, > u, U; > u)) 
there exist £ > 0 and c < 1/2 such that 


-1 —>uĵl Xk,j € [-1, 1), 


log(P(Ux > u, U; > u)) 
So EON NE eR Ee 1—e,1). 
2 log(1 — u) ai ie aet 


Taking logarithms yields 


1 P(Uņp > u, U; > u) 


d 
e =u} . 
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= L exp (is (Aa ee aos Z 2) du 


= f /( 2 log(1 — u) (1 ae )) i 


1 
< | exp(—2clog(1 — w)) du 
1 


—E 


I 1 
= —— du < oo, 
iz (l= u)?* 


as 2c < 1. This completes the proof of Proposition 5.3.13. 


Corollary 5.3.14 We have E(N(2)) < co for multivariate normal rvs, 
unless all components are completely dependent, precisely, unless all 


bivariate coefficients of correlation are one. 
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distribution), 102 
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Homogeneity, 1 
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Idempotent D-norm, 40,43, 47,49 
Identity element, 39 
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iid (independent and identically 
distributed), 7 

Inclusion-exclusion principle, 20 
Inner product, 78 
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random set, 87 
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Jensen’s inequality, 48 


Krein—Milman theorem 
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Linear affine functional, 67 
Locally convex vector space, 68 
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Random set, 86 
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simple, 214 
Record time, 225 
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infinite expectation, 228 
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variable), 4 
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expected number, 224 
Simplex, 68 
Sklar’s theorem, 136 
SMS (standard max-stable), 110 
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Spectral decomposition 
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Survival probability 

of standard max-stable rv, 122 
Symmetric root 

of positive definite matrix, 3 


Tail dependence, 120 
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